CoT: Chain-of-Thought—a prompting technique where models generate intermediate reasoning steps before the final answer
Process Supervision: Training method that rewards the model for correct intermediate reasoning steps, not just the final outcome
SFT: Supervised Fine-Tuning—training the model on labeled input-output pairs to initialize its behavior
RL: Reinforcement Learning—training method where an agent learns to make decisions by receiving rewards or penalties
RadRScore: A custom metric evaluating reasoning based on Factuality (correctness), Completeness (coverage), and Effectiveness (relevance)
VQA: Visual Question Answering—a task where the model answers questions based on an image
Findings: The section of a radiology report describing objective observations (e.g., 'opacities observed')
Impression: The section of a radiology report providing the final diagnosis or conclusion based on the findings
RaTEscore: Automatic evaluation metric for radiology reports that assesses the quality of generated text against references