← Back to Paper List

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Lei Huang, Weijiang Yu, Weitao Ma, Weihong Zhong, Zhangyin Feng, Haotian Wang, Qianglong Chen, Weihua Peng, Xiaocheng Feng, Bing Qin, Ting Liu
Harbin Institute of Technology, Huawei Inc.
arXiv (2023)
Factuality Benchmark RAG RL Pretraining

📝 Paper Summary

Hallucination Taxonomy Hallucination Causes Hallucination Detection Hallucination Mitigation
This survey redefines LLM hallucinations into factuality and faithfulness categories, traces their causes through data, training, and inference stages, and systematically reviews detection and mitigation strategies.
Core Problem
LLMs prone to generating plausible yet non-factual content (hallucinations) pose significant risks to reliability in real-world applications like search engines and medical advice.
Why it matters:
  • Misleading information from widely used systems (chatbots, search) can spread false beliefs or cause harm in decision-making
  • Traditional NLG hallucination definitions (intrinsic/extrinsic) are insufficient for open-ended LLMs, which require a broader scope covering factual errors and user-instruction misalignment
  • The convincing, human-like nature of LLM responses makes detecting these errors particularly challenging for users
Concrete Example: When asked about the Eiffel Tower's environmental impact, an LLM might fabricate a claim that it 'led to the extinction of the Parisian tiger'—a species that never existed. This is a Factual Fabrication (Unverifiability) hallucination.
Key Novelty
Comprehensive LLM-Specific Hallucination Taxonomy and Cause Analysis
  • Proposes a new taxonomy splitting hallucinations into 'Factuality' (conflict with real-world facts) and 'Faithfulness' (conflict with user instructions, context, or self-consistency)
  • Analyzes causes across three stages: Data (misinformation, biases), Training (pre-training flaws, misalignment), and Inference (decoding strategies, softmax bottlenecks)
  • Connects specific mitigation strategies (like RAG or feedback-based editing) directly to the identified causes in a structured framework
Evaluation Highlights
  • Surveys benchmarks like TruthfulQA, HalluQA, and HaluEval-2.0 to quantify hallucination rates
  • Highlights that detection methods include both factuality checks (using external knowledge) and faithfulness checks (checking consistency with context/instructions)
  • Discusses mitigation success in specific areas, such as RAG (Retrieval-Augmented Generation) reducing factual errors, though noting RAG itself can suffer from hallucinations
Breakthrough Assessment
9/10
A foundational survey that establishes the standard taxonomy for LLM hallucinations. It structures a chaotic field into clear categories (Factuality vs. Faithfulness) and causes, guiding future research effectively.
×