VLA: Vision-Language-Action model—an AI system that processes visual and text inputs to generate both linguistic reasoning and physical actions
Chain of Causation (CoC): A structured reasoning format proposed in this paper that explicitly links observed scene evidence to driving decisions
Flow Matching: A generative modeling technique (related to diffusion models) used here to generate continuous, smooth vehicle trajectories
RLHF: Reinforcement Learning from Human Feedback—fine-tuning models based on rewards derived from human preferences or verifiable outcomes
Open-loop evaluation: Testing a driving model on pre-recorded data to see if it predicts the expert's path (without the vehicle actually moving/reacting)
Closed-loop evaluation: Testing a driving model in a simulator where the vehicle's actions affect future observations and states
SFT: Supervised Fine-Tuning—training the model on labeled examples of reasoning and driving actions
Long-tail scenarios: Rare, edge-case driving situations (e.g., construction zones, debris) that occur infrequently but are critical for safety