AuthenHallu: The proposed benchmark dataset constructed from real-world LLM-human dialogues
LMSYS-Chat-1M: A large-scale dataset of 1 million real-world conversations between humans and various LLMs
Hallucination: LLM outputs that are incorrect or inconsistent with the context or user input
Deliberately Induced Generation: A data creation strategy where models are explicitly prompted to generate incorrect information (e.g., HaluEval)
Simulated Interactive Generation: A strategy where queries are collected/crafted and responses generated, but interactions are not from real users (e.g., FELM)
Fact-conflicting: A hallucination category where the output contradicts established world knowledge
Input-conflicting: A hallucination category where the output contradicts the user's explicit input prompt
Context-conflicting: A hallucination category where the output contradicts previous turns in the dialogue history
Vanilla LLM: Using a standard, off-the-shelf Large Language Model without additional fine-tuning or external tools
IAA (Inter-Annotator Agreement): A statistical measure (like Fleiss's Kappa) evaluating how consistently different human annotators assign labels