GRPO: Group Relative Policy Optimization—a reinforcement learning algorithm that optimizes a policy by comparing a group of outputs against each other to estimate a baseline, reducing computational cost compared to critic-based methods
SAM2: Segment Anything Model 2—a foundation model for segmentation that generates masks from prompts like points or bounding boxes
S-Measure: Structure-measure—a segmentation metric that evaluates both region-aware and object-aware structural similarity between a predicted mask and the ground truth
IoU: Intersection over Union—a standard metric measuring the overlap between the predicted segmentation mask and the ground truth mask
COD: Camouflaged Object Detection—identifying objects that blend into their surroundings
SOD: Salient Object Detection—identifying the most visually distinctive objects in an image
FCoT: Foreground Chain-of-Thought—a new dataset introduced in this paper containing images annotated with step-by-step reasoning and SAM2 prompts
LMM: Large Multimodal Model—a model capable of processing and generating both text and image data
SFT: Supervised Fine-Tuning—training a model on labeled input-output pairs