GHS: Globally Harmonized System—an international standard for classifying and labeling chemicals (e.g., hazard pictograms)
OSHA: Occupational Safety and Health Administration—U.S. agency setting standards for workplace safety
MLLM: Multimodal Large Language Model—AI models processing both text and visual inputs
VLA: Vision-Language-Action—models that output robot control actions directly from visual and text inputs
PRP: Perception-Reasoning-Planning—a classical cognitive architecture separating sensing, logical inference, and action generation
VQA: Visual Question Answering—tasks where a model answers questions based on an image or video
MCQ: Multiple-Choice Question—a structured query format with predefined answer options
Jaccard Index: A statistic used for gauging the similarity and diversity of sample sets (intersection over union)
Safety L23: A specific metric subset in this paper focusing on Moderate-risk (S2) and High-risk (S3) scenarios