Evident Conflict: Generated content directly contradicts provided information (e.g., wrong numbers, factual errors)
Subtle Conflict: Generated content diverges from provided information by altering intended meaning or severity without direct negation
Evident Baseless Info: Generated content includes fabricated details completely absent from the source
Subtle Baseless Info: Generated content adds unverifiable inferred details, sentiments, or subjective assumptions
SelfCheckGPT: A zero-resource hallucination detection method that checks consistency by sampling multiple responses from the same model
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
SFT: Supervised Fine-Tuning—training a model on labeled examples
NLI: Natural Language Inference—determining if a hypothesis is true (entailment), false (contradiction), or unrelated (neutral) given a premise