computable LLM: An LLM defined as a computable function within a formal mathematical framework, subject to the limits of computability theory
diagonalization: A mathematical proof technique used to show that certain sets are larger than others, used here to prove there are always inputs where the LLM fails
intrinsic hallucination: Generated content that contradicts the provided input context (e.g., a summary contradicting the source text)
extrinsic hallucination: Generated content that cannot be verified from the source text and contradicts real-world knowledge or training data
RAG: Retrieval-Augmented Generation—a technique that grounds LLM responses in external documents to reduce hallucinations
fluency heuristic: A cognitive bias where users judge the accuracy of information based on how grammatically correct and smooth the text appears
automation bias: The tendency for humans to over-rely on automated systems and accept their outputs as correct, even when they are not
Toolformer: An architectural approach where LLMs are trained to use external tools (calculators, APIs) to improve accuracy
FActScore: A benchmark metric that evaluates factual consistency in summarization by breaking sentences into atomic facts
HalluLens: A benchmark systematically mapping hallucinations to a taxonomy including factual, ethical, logical, and temporal dimensions
TruthfulQA: A benchmark of adversarially constructed questions designed to test whether models mimic human falsehoods or misconceptions