mid-to-mid: A planning approach that takes processed perception data (bounding boxes, maps) as input rather than raw sensor data (end-to-end) or just abstract goals
closed-loop: Evaluation where the ego-vehicle's actions influence future states of the environment and itself, accumulating errors over time
BEV map: Bird's Eye View map—a top-down visual representation of the driving scene including lanes, agents, and obstacles
IDM: Intelligent Driver Model—a mathematical model for simulating traffic flow and driver behavior, used here as the underlying controller
CoT: Chain-of-Thought—a prompting technique encouraging LLMs to break down reasoning into intermediate steps
lane-graph: A graph representation of road networks where lane segments are nodes and connections are edges, used to compactly represent topology
nuPlan: A large-scale dataset and benchmark for autonomous driving planning focusing on closed-loop evaluation
long-tailed scenarios: Rare, complex, or edge-case driving situations (e.g., dense construction zones, erratic pedestrians) not covered by standard rules
SDE: State Dropout Encoder—a technique used in baselines like PlanTF to improve robustness
MLLM: Multi-modal Large Language Model—an AI model capable of processing and reasoning across multiple modalities, such as text and images