← Back to Paper List

Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects

Yixin Cui, Haotian Lin, Shuo Yang, Yixiao Wang, Yanjun Huang, Hong Chen
School of Automotive Studies, School of Physics Science and Engineering, College of Electronics and Information Engineering, Clean Energy Automotive Engineering Center
arXiv.org (2025)
MM Reasoning Agent Benchmark

📝 Paper Summary

Autonomous Driving (AD) Large Language Models (LLMs) Survey / Literature Review
This survey systematically categorizes Chain-of-Thought applications in autonomous driving, proposing a 'Thought Transition' formalism to enhance system interpretability and reasoning in complex scenarios.
Core Problem
Current autonomous driving paradigms (rule-based and data-driven end-to-end) struggle with deep reasoning, interpretability, and generalization in complex, dynamic, or long-tail traffic scenarios.
Why it matters:
  • Rule-driven systems lack flexibility in dynamic environments, while data-driven black-box models suffer from poor interpretability and data dependency
  • LLMs excel at responsiveness but often fail at deep reasoning required for complex driving tasks without structured guidance
  • There is a lack of comprehensive reviews specifically focusing on how Chain-of-Thought technology advances autonomous driving distinct from general LLM applications
Concrete Example: In a complex intersection, a standard end-to-end model might output a 'stop' command without understanding why. A CoT-enabled system would reason: 'Pedestrian detected -> Projected path intersects ego vehicle -> Risk high -> Decision: Stop', providing transparency and better handling of the corner case.
Key Novelty
Systematic Survey of CoT in Autonomous Driving
  • Formalizes the 'Thought Transition' process for driving, modeling reasoning as a recursive sequence of steps (thoughts) and intermediate states rather than a direct input-output mapping
  • Categorizes existing research into modular applications (Perception, Prediction, Planning) and End-to-End frameworks, identifying 'Logical' vs. 'Reflective' reasoning patterns
  • Proposes combining CoT with self-learning mechanisms to enable 'self-evolution' in autonomous systems, moving towards knowledge-driven driving
Evaluation Highlights
  • Categorizes over 30 recent approaches (2023-2025) including DriveVLM, DiLu, and Agent-Driver based on their pipeline (Modular vs. End-to-End) and cognitive process
  • Identifies three evolutionary stages of AD paradigms: Rule-driven -> Data-driven -> Knowledge-driven (current focus)
  • Establishes a dynamic repository 'Awesome-CoT4AD' to track forefront developments in the field
Breakthrough Assessment
7/10
While a survey/review paper (no new SOTA model), it provides a critical definition of the emerging 'Knowledge-Driven' paradigm and formalizes the CoT theoretical framework for the field.
×