HHEM-2.1-Open: Vectara's open-source hallucination detection model used as a filter for selecting challenging samples
TrueTeacher: A Google model used to generate synthetic training data or labels for hallucination detection
True-NLI: A Natural Language Inference model used to check if a summary is entailed by the source document
LLM-as-a-judge: Using a strong LLM (like GPT-4) to evaluate the outputs of other models
Benign hallucination: Information not in the source text but supported by world knowledge or reasoning, considered acceptable by readers
Intrinsic hallucination: Generated content that explicitly contradicts the source passage
Extrinsic hallucination: Generated content that is neither supported by the passage nor inferable from it, nor factual
Krippendorff’s alpha: A statistical measure of the agreement achieved when coding a set of units of analysis