Iterative Retrieval: A process where an LLM repeatedly queries a retriever to gather information in steps rather than all at once
RAG: Retrieval-Augmented Generation—providing an LLM with external documents to ground its answers
Parametric Knowledge: Knowledge stored within the LLM's weights during pre-training, as opposed to external knowledge from retrieval
Instruction Synthesis: Using a powerful LLM to generate training examples (input-output pairs) to teach a smaller model a specific task
SFT: Supervised Fine-Tuning—training a model on labeled examples to follow specific instructions
Retriever: A system (usually a dense vector model) that finds relevant documents from a large corpus given a query
F1 score: A metric measuring the overlap between the predicted answer and the ground truth, balancing precision and recall
EM: Exact Match—a strict metric requiring the predicted answer to be identical to the ground truth
Self-RAG: A baseline method that trains an LLM to generate special reflection tokens to control retrieval and critique generation
FLARE: Forward-Looking Active Retrieval Augmented Generation—a baseline that triggers retrieval when the model generates low-confidence tokens
ITER-RETGEN: A baseline method that concatenates previous generations to retrieve information for the next generation step
CoT: Chain-of-Thought—a prompting technique encouraging the model to generate intermediate reasoning steps
Dense Retrieval: Retrieving documents based on semantic similarity of vector embeddings rather than keyword matching