RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Self-RAG: A framework where a model learns to retrieve, generate, and critique its own output using special reflection tokens
CRAG: Corrective Retrieval Augmented Generation—the proposed method that evaluates and corrects retrieval results
FactScore: An evaluation metric that breaks down a generation into atomic facts and verifies how many are supported by a knowledge source (e.g., Wikipedia)
T5-large: A text-to-text transfer transformer model (~770M parameters) used here as the lightweight retrieval evaluator
Contriever: A dense information retrieval model used to fetch relevant documents from a corpus
PopQA: A dataset for short-form entity generation tasks
Arc-Challenge: A multiple-choice question dataset requiring reasoning
PubHealth: A true-or-false question dataset focused on public health claims
Decompose-then-recompose: A method to split documents into smaller strips, evaluate each strip's relevance, and reconstruct a clean context by concatenating only the relevant parts