OTS: Off-the-shelf—using a pre-trained model directly for inference without further training (zero-shot)
PEFT: Parameter-Efficient Fine-Tuning—adapting a large model by updating only a small subset of parameters (e.g., LoRA)
LoRA: Low-Rank Adaptation—a PEFT technique that injects trainable low-rank matrices into model layers
Linear Probing: Training a simple linear classifier on top of frozen model embeddings
ID: In-Distribution—tasks/modalities seen during the model's pretraining (e.g., radiology for a radiology-specialist model)
OOD: Out-of-Distribution—tasks/modalities not covered during pretraining (e.g., dermatology for a radiology-specialist model)
AUROC: Area Under the Receiver Operating Characteristic curve—a performance metric for classification tasks
GPT Score: A semantic evaluation metric where GPT-4 scores the quality of a VLM's generated answer against a reference
VQA: Visual Question Answering—the task of answering natural language questions about an image