VLM: Vision-Language Model—an AI model capable of understanding and generating text based on visual inputs
Bradley-Terry Model: A statistical model used to estimate the relative skill or strength of competitors based on the outcomes of paired comparisons
Chatbot Arena: An open-source platform where users chat with anonymous models side-by-side and vote on which response they prefer
OCR: Optical Character Recognition—the conversion of images of typed, handwritten, or printed text into machine-encoded text
NER: Named Entity Recognition—identifying and classifying key information (entities) in text into predefined categories
CSAM: Child Sexual Abuse Material—harmful content that is automatically filtered out of the dataset
NSFW: Not Safe For Work—content containing nudity, violence, or other sensitive material
PII: Personally Identifiable Information—data that could identify a specific individual
MMMU: A massive multi-discipline multimodal understanding and reasoning benchmark
HallusionBench: A benchmark designed to diagnose visual illusions and hallucinations in VLMs