Chain-of-Thought (CoT): A reasoning process where complex decisions are broken down into intermediate logical steps (e.g., Hazard Detection -> Rule Compliance -> Action)
End-to-End Driving: A system that maps raw sensor data directly to driving controls or trajectories, bypassing traditional modular pipelines
CARLA: An open-source simulator for autonomous driving research used to generate synthetic training data and evaluate agents
Waypoints: A sequence of 2D coordinates representing the planned future path of the vehicle
Video-SwinTransformer: A vision transformer architecture adapted for video input that uses 3D shifted windows to model temporal and spatial relationships
PID Controller: Proportional-Integral-Derivative controller—a control loop mechanism used to convert high-level plans (speed/waypoints) into low-level actuation (throttle/steer)
Open-loop evaluation: Testing a model's predictions against a pre-recorded dataset without the model's actions affecting the future states
Closed-loop evaluation: Testing a model in a simulator where its actions dynamically influence the environment and future states