Factuality: The probability of LLMs producing content consistent with established facts (grounded in reliable sources)
Hallucination: The tendency of models to produce content that is nonsensical or untruthful in relation to sources, or unfaithful to the prompt (even if factually correct)
Snowballing: An inference-level error where an initial incorrect generation leads the model to produce further consistent but incorrect information
Exposure Bias: A discrepancy between training (ground truth available) and inference (model relies on own predictions), potentially leading to error propagation
RAG: Retrieval-Augmented Generation—systems that retrieve external documents to ground generation
SFT: Supervised Fine-Tuning—training on labeled examples to align the model
CoT: Chain-of-Thought—a prompting strategy where the model generates intermediate reasoning steps
Model Editing: Techniques to directly update specific facts within the model's parameters without full re-training
Standalone LLMs: LLMs that rely solely on their internal parametric knowledge without external retrieval
MMLU: Massive Multitask Language Understanding—a benchmark covering 57 subjects to test world knowledge
TruthfulQA: A benchmark specifically designed to test whether models mimic human falsehoods or generate truthful answers