Steering Vector: A direction in the model's activation space that, when added to hidden states, biases the model's behavior (e.g., towards creativity or factuality)
CHAIR: Captioning Hallucination Assessment with Image Relevance—a metric measuring the percentage of generated objects not present in the image
POPE: Polling-based Object Probing Evaluation—a benchmark testing whether a model answers 'yes' or 'no' correctly regarding object existence
VDAT: Visual Divergent Association Test—a new benchmark proposed in this paper to measure associative reasoning by asking models to generate nouns unrelated to an image
VCD: Visual Contrastive Decoding—a baseline method that reduces hallucination by contrasting logits from original and distorted visual inputs
Ha-DPO: Hallucination Direct Preference Optimization—a training-based baseline that aligns models to prefer grounded over hallucinated responses
Associative Reasoning: The cognitive process of connecting ideas; 'convergent' for facts (faithfulness) and 'divergent' for creativity
Cosine Distance: A metric measuring the directional difference between two vectors
Euclidean Distance: A metric measuring the straight-line magnitude difference between two vectors