SV-CoT: Structured Visual Chain-of-Thought—a reasoning framework that forces models to localize, describe, and grade abnormalities before diagnosing
ROI: Region of Interest—specific areas in a medical image (e.g., hippocampus) relevant to the diagnosis, marked by bounding boxes
mIoU: mean Intersection over Union—a metric measuring how accurately the predicted bounding box overlaps with the ground-truth box
RAG: Retrieval-Augmented Generation—enhancing model responses by retrieving relevant documents from an external knowledge base (MIRIAD in this paper)
OASIS: Open Access Series of Imaging Studies—the public MRI dataset source used for constructing S-Chain
Scheltens/Pasquier/Koedam: Standardized visual rating scales used by radiologists to grade atrophy in specific brain regions for dementia diagnosis
SFT: Supervised Fine-Tuning—training a pre-trained model on a specific labeled dataset to adapt it to a downstream task
InfoNCE: Information Noise Contrastive Estimation—a loss function used to learn representations by pulling positive pairs close and pushing negative pairs apart