← Back to Paper List

Hallucinations are inevitable but can be made statistically negligible. The "innate" inevitability of hallucinations cannot explain practical LLM issues

Atsushi Suzuki, Yulan He, Feng Tian, Zhongyuan Wang
The University of Hong Kong, King's College London, The Alan Turing Institute, United Kingdom, Duke Kunshan University, China, Wuhan University, China
arXiv (2025)
Factuality Reasoning

📝 Paper Summary

Theoretical limits of LLMs Statistical learning theory for LLMs
While computability theory proves hallucinations occur on infinite inputs, statistical learning theory shows their probability can be reduced to near-zero given sufficient training data.
Core Problem
Recent computability-theoretic results claim hallucinations are 'inevitable' for any LLM regardless of data or architecture, leading to the pessimistic belief that they are an unsolvable fundamental limitation.
Why it matters:
  • Misinterpretation of theoretical limits discourages practical efforts to eliminate hallucinations in critical applications
  • Popular media and researchers cite diagonal-argument proofs to claim hallucinations 'cannot be stopped', creating a fatalistic view of LLM reliability
  • Practitioners need to know if failures are due to fundamental mathematical limits or just insufficient data/algorithms
Concrete Example: A diagonal argument might prove an LLM must fail on a specific adversarial input like a Gödel sentence. However, if that input occurs with probability 10^-100 in real usage, the system is practically reliable, contradicting the 'inevitability' claim's practical implication.
Key Novelty
Statistical Negligibility of Inevitable Hallucinations
  • Demonstrates that an infinite set of failure cases (proven by computability theory) can still have an arbitrarily small total probability measure
  • Applies Shannon's source coding theorem as an analogy: errors may be theoretically inevitable on some inputs but statistically negligible in practice
  • Proves existence of a Language Model Trainer (LMT) that achieves near-zero hallucination risk given sufficient qualified training data
Evaluation Highlights
  • Proves hallucinations are uniformly statistically negligible on the set of probability measures with a lower-bounded input length CDF
  • Proves hallucinations are non-uniformly statistically negligible on the set of all probability measures on the input space
  • Establishes mathematically that computability-theoretic 'inevitability' coexists with probability-theoretic 'negligibility'
Breakthrough Assessment
7/10
Theoretical paper that clarifies a major misconception in the field. While it doesn't propose a new architecture, it provides the mathematical grounding to refute the 'hallucinations are unsolvable' narrative.
×