← Back to Paper List

A comprehensive taxonomy of hallucinations in Large Language Models

Manuel Cossio
Universitat de Barcelona
arXiv.org (2025)
Factuality Benchmark MM Reasoning

📝 Paper Summary

Hallucination Taxonomy Theoretical limits of LLMs Hallucination Mitigation
This report establishes a formal taxonomy for LLM hallucinations, proving their theoretical inevitability in computable models while categorizing their manifestations, causes, and mitigation strategies.
Core Problem
LLMs frequently generate plausible but factually incorrect or fabricated content, posing risks to reliability in critical applications like healthcare and law.
Why it matters:
  • Hallucinations in safety-critical domains (medical, legal) can lead to severe consequences, such as misinforming patients or citing non-existent court cases
  • Current definitions are inconsistent (e.g., intrinsic vs. extrinsic), hindering comparative research and the development of unified mitigation strategies
  • Users often over-rely on confident but incorrect AI outputs due to cognitive biases like automation bias and the fluency heuristic
Concrete Example: An LLM summarizing an article stating the FDA approved an Ebola vaccine in 2019 might intrinsically hallucinate by claiming the FDA rejected it, or extrinsically hallucinate by inventing a claim that the Parisian Tiger was hunted to extinction in 1885.
Key Novelty
Theoretical Inevitability of Hallucination
  • Posits that hallucination is not merely a bug but an innate limitation of computable LLMs, proven via diagonalization in computability theory
  • Demonstrates that for any computable LLM, there exists a ground truth function where the model will hallucinate on an infinite number of inputs
  • Argues that self-elimination of hallucination is impossible, necessitating external aids like RAG or human oversight
Evaluation Highlights
  • Identifies that logical inconsistencies account for 19% of hallucination cases in surveyed analyses
  • Notes that temporal disorientation (errors with time-sensitive info) accounts for 12% of identified hallucination cases
  • Reports that ethical violations (harmful/defamatory content) represent 6% of hallucination cases
Breakthrough Assessment
7/10
While primarily a survey and taxonomy, the paper significantly strengthens the theoretical foundation by formalizing the 'inevitability' argument based on computability theory, shifting the field's focus from elimination to mitigation.
×