← Back to Paper List

HeartAgent: An Autonomous Agent System for Explainable Differential Diagnosis in Cardiology

Shuang Zhou, Kai Yu, Song Wang, Wenya Xie, Zaifu Zhan, Meng-Han Tsai, Yuen-Hei Chung, Shutong Hou, Huixue Zhou, Min Zeng, Bhavadharini Ramu, Lin Yee Chen, Feng Xie, Rui Zhang
University of Minnesota, University of Central Florida, University of Colorado Anschutz Medical Campus, University of California San Francisco
arXiv (2026)
Agent MM RAG Factuality Reasoning

📝 Paper Summary

Medical Agentic AI Clinical Decision Support Differential Diagnosis
HeartAgent is a multi-agent system that autonomously orchestrates specialized agents and cardiology-specific tools to generate verified differential diagnoses and explanations from multimodal clinical data.
Core Problem
General-purpose diagnostic AI often lacks deep cardiology knowledge, struggles with complex reasoning over overlapping symptoms, and operates as a 'black box' without verifiable evidence.
Why it matters:
  • Heart diseases are a leading cause of death requiring precise differentiation between conditions with similar presentations (e.g., aortic dissection vs. myocardial infarction)
  • Trustworthy clinical AI requires transparency and adherence to guidelines, not just prediction accuracy
  • Existing models fail to integrate heterogeneous data (notes, ECG, imaging) with external medical knowledge effectively
Concrete Example: A patient with chest pain and diaphoresis might be misdiagnosed by a standard model that misses subtle distinctions. HeartAgent's reviewers might catch that 'estimated valve area 0.6 cm²' implies valvular stenosis (not just general heart disease) by cross-referencing specific guidelines.
Key Novelty
Collaborative Multi-Agent Cardiology Framework
  • Decomposes diagnosis into collaborative roles: a predictor, a generalist examiner (checking non-cardiac causes), and a specialist reviewer (refining cardiac hypotheses)
  • Integrates a 'reference verification' step that explicitly retrieves and validates supporting evidence from guidelines to ground explanations
  • Dynamic tool use that combines visual analysis (ECG/Echo) with retrieval from curated cardiology knowledge bases
Evaluation Highlights
  • +36% improvement in top-3 diagnostic accuracy over Chain-of-Thought baselines on the MIMIC dataset
  • Clinicians assisted by HeartAgent surpassed unaided experts by 26.9% in diagnostic accuracy on random MIMIC cases
  • Achieved 92% precision in retrieving correct supporting references for diagnostic explanations
Breakthrough Assessment
8/10
Strong clinical utility demonstration with human-in-the-loop evaluation showing significant gains. The integration of self-verification and explicit reference retrieval addresses key trust issues in medical AI.
×