RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
CoT: Chain-of-Thought—a prompting technique where the model generates intermediate reasoning steps before the final answer
External Knowledge Augmentation: Using a first pass of retrieval to generate a preliminary answer/rationale, which is then appended to the query for a second, better retrieval pass
Internal Knowledge Augmentation: Asking the LLM to generate a preliminary answer based solely on its pre-trained memory, then appending that to the query for retrieval
Exact Match (EM): A metric measuring the percentage of predictions that match the ground truth answer exactly
F1 score: A metric balancing precision and recall measuring word overlap between prediction and ground truth
Contriever: A dense retrieval model used to encode queries and documents into vectors for similarity search
Self-RAG: A baseline method that uses self-reflection tokens to critique and filter retrieved content