RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Prober: A small neural network classifier attached to intermediate layers of a large model to diagnose its internal state or knowledge
Hidden States: The intermediate numerical representations of input text as it passes through the layers of a Transformer model
CoT: Chain-of-Thought—a prompting technique where the model generates intermediate reasoning steps before the final answer
Parametric Knowledge: Information stored directly in the model's weights during pre-training, as opposed to information found in external documents
Knowledge Conflicts: Situations where the model's internal knowledge contradicts the information found in retrieved documents
Adaptive Retrieval: RAG systems that dynamically decide when, what, or how often to retrieve based on query complexity or model uncertainty
FLARE: Forward-Looking Active REtrieval—a method that triggers retrieval when generated tokens have low probability
EM: Exact Match—a metric checking if the generated answer is character-for-character identical to the ground truth