VFM: Vision Foundation Models—large-scale pre-trained vision models like ViT and SAM
VLP: Vision-Language Pre-training models—models trained on image-text pairs to learn aligned representations (e.g., CLIP)
VLM: Vision-Language Models—models capable of processing and generating both images and text (e.g., GPT-4V)
DM: Diffusion Models—generative models that create data (images/audio) by reversing a noise addition process
Jailbreak: Attacks that bypass a model's safety guardrails (alignment) to elicit prohibited or harmful content
Prompt Injection: Attacks where malicious instructions are inserted into the input context to hijack the model's intended task
Backdoor: A hidden vulnerability injected during training that causes the model to behave maliciously only when a specific 'trigger' is present in the input
Adversarial Attack: Subtle, often imperceptible perturbations to input data designed to cause model error
Indirect Prompt Injection: An attack on agents where the malicious prompt is hidden in external data (e.g., a webpage) that the agent retrieves, rather than being typed by the user
Agentic AI: LLM-based systems that can autonomously plan, reason, and use tools to accomplish complex tasks