VIB: Variational Information Bottleneck—a method to learn a compressed representation that keeps only the information relevant to a target task
POPE: Polling for Object Hallucination Evaluation—a benchmark asking Yes/No questions about objects in an image to test VLM factuality
CHAIR: Captioning Hallucination Assessment with Image Relevance—a metric measuring the proportion of generated objects not present in the ground truth
AUROC: Area Under the Receiver Operating Characteristic curve—a performance metric for classification problems at various threshold settings
AUPRC: Area Under the Precision-Recall Curve—a metric focusing on positive class performance, useful for imbalanced datasets
KL divergence: Kullback-Leibler divergence—a statistical distance measuring how one probability distribution differs from a second, reference distribution
logit: The raw, unnormalized prediction vector generated by a neural network before the final activation function (like softmax)
attention heads: Components in Transformer models that allow the model to focus on different parts of the input sequence simultaneously