SAM: Sharpness-Aware Minimization—an optimization technique that seeks parameters in a flat region of the loss landscape to improve generalization and robustness
Relearning Attack: An adversarial evaluation method where an unlearned model is fine-tuned on a small amount of the original 'forgotten' data to see if the erased behavior resurfaces
Chair: Captioning Hallucination Assessment with Image Relevance—a metric for quantifying object hallucinations in image captioning (Chair_S = sentence level, Chair_I = image level)
POPE: Polling-based Object Probing Evaluation—a method to evaluate object hallucination by asking yes/no questions about the existence of objects in an image
EFUF: Efficient Fine-grained Unlearning Framework—a baseline unlearning method for MLLMs that uses negative and positive subsentences but lacks sharpness-aware optimization
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that freezes pre-trained weights and injects trainable rank decomposition matrices
MHumanEval: A human-verified evaluation benchmark for assessing hallucinations in multimodal outputs
Targeted-SAM: The paper's specific adaptation of SAM where the inner maximization targets the hallucination loss specifically to find the worst-case relapse direction
CLIP: Contrastive Language-Image Pre-training—a model used here to score the alignment between image regions and text segments to detect hallucinations automatically