Attestation Bias (Λ): The tendency of an LLM to affirm entailment simply because the hypothesis sentence is attested (memorized) from its training data, regardless of the premise
Relative Frequency Bias (Φ): The tendency of an LLM to affirm entailment if the premise's predicate is less frequent in the corpus than the hypothesis's predicate, mimicking a specific-to-general relationship
NLI: Natural Language Inference—determining whether a hypothesis is true given a premise
Levy/Holt: A specific NLI dataset focused on directional entailment between predicates (e.g., 'murder' entails 'kill' but 'kill' does not entail 'murder')
RTE-1: Recognizing Textual Entailment 1—a classic, difficult NLI dataset
AUC norm: Area Under the Curve normalized so that 0% represents random chance performance and 100% represents perfect classification
Hallucination: In this context, specifically false positive entailments where the model claims a logical relationship exists when it does not
Directional Entailment: Relationships that hold in one direction but not both (e.g., 'buy' entails 'own', but 'own' does not entail 'buy')
zero-shot: Asking the model to perform a task without providing any examples in the prompt
few-shot: Providing the model with a small number of examples (here, 4) in the prompt before the target query
entity linker: A tool that identifies named entities in text and links them to a knowledge base (here, Freebase IDs)
FIGER types: A set of fine-grained entity types (e.g., 'person', 'location') used to categorize named entities