Snowball effect: The phenomenon where hallucinations progressively accumulate; an error in one sentence increases the likelihood of errors in subsequent sentences
RAG: Retrieval-Augmented Generation—providing external documents to an LLM to ground its answers
Generative Annotator: An LLM trained to output text explaining the hallucination (type, reference, correction) rather than just a classification label
Discriminative Annotator: A model trained only to classify the type of hallucination without generating corrections or references
CoSENT: A text embedding model used for sentence similarity and retrieval tasks
BM25: A ranking function used by search engines to estimate the relevance of documents to a given search query
Contradictory Hallucination: Information that conflicts with the provided reference source
Unverifiable Hallucination: Information not found in the reference source, making its truth value unknown based on available context
No Fact: Sentences containing no factual claims (e.g., chit-chat or structural transitions)