DPO: Direct Preference Optimization—a method to fine-tune models on preference pairs (winner/loser) by optimizing a classification loss, avoiding a separate reward model
VLLM: Vision Large Language Model—a multimodal model combining a visual encoder with a large language model to process images and text
CHAIR: Captioning Hallucination Assessment with Image Relevance—a metric evaluating the accuracy of object descriptions by comparing captioned objects to ground truth
POPE: Polling on Object Existence—a benchmark using binary Yes/No questions to test if a model hallucinates non-existent objects
diffusion noise: Gaussian noise added to an image, used here to disrupt visual features and trigger the model's reliance on language priors (hallucinations)
RLHF: Reinforcement Learning from Human Feedback—training method using human preferences to guide model behavior
hallucination: Generations where the model produces content not grounded in the input image, often based on language priors or spurious correlations
object co-occurrence: The statistical tendency of certain objects (e.g., knife and fork) to appear together, which can mislead models to hallucinate one when only the other is present