atomic unit: The smallest verifiable piece of information in a generation, such as a single sentence in a summary or a specific package import in code
Type A error: Hallucination where the correct fact was present in the pretraining data (failed recall)
Type B error: Hallucination where the incorrect fact (or fact taken out of context) was present in the pretraining data
Type C error: Hallucination where neither correct nor incorrect facts were in the pretraining data (pure fabrication)
Response-Based task: A task where the model is expected to provide a helpful answer (e.g., summarization, code generation)
Refusal-Based task: A task where the model is expected to abstain from answering because the premise is false or impossible (e.g., 'historically famous meeting between people who never met')
Utility Score: A metric that rewards factual answers for response-based tasks and rewards silence/refusal for refusal-based tasks
Response Ratio: The proportion of prompts for which the model attempts to generate an answer rather than refusing
WIMBD: A tool/index for searching large pretraining corpora to attribute model generations to training data
intrinsic hallucination: Errors where the model misinterprets information provided in the input context
extrinsic hallucination: Errors where the model introduces new, external information not found in the input context