hallucination: A response containing information that cannot be fully verified from the source knowledge snippet (even if true in the real world).
entailment: A response fully supported by the knowledge snippet; all information is attributable to the source.
VRM: Verbal Response Modes—a taxonomy for classifying speech acts (e.g., Disclosure, Edification, Advisement).
BEGIN: Benchmark for Evaluation of Grounded Interaction—a taxonomy used to classify response groundedness (Entailment, Hallucination, Generic, Uncooperative).
disclosure: A VRM category where the speaker reveals subjective opinions, thoughts, feelings, or personal experiences.
edification: A VRM category concerning objective information.
uncooperative: A response that is entailed by the source but does not follow conversational principles (e.g., incoherent with history, purely extractive).
nucleus sampling: A decoding strategy that samples from the smallest set of top vocabulary tokens whose cumulative probability exceeds a threshold p.