← Back to Paper List

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Jinyuan Fang, Yanwen Peng, Xi Zhang, Yingxu Wang, Xinhao Yi, Guibin Zhang, Yi Xu, Bin Wu, Siwei Liu, Zihao Li, Zhaochun Ren, Nikos Aletras, Xi Wang, Han Zhou, Zaiqiao Meng
University of Glasgow, University of Sheffield, Mohamed bin Zayed University of Artificial Intelligence, National University of Singapore, University of Cambridge, University College London, University of Aberdeen, Leiden University
arXiv (2025)
Agent Memory RL Reasoning Benchmark

📝 Paper Summary

Self-evolving Agentic reasoning Evolving foundational agentic capabilities Multi-Agent Self-Evolving (MASE)
This survey establishes a unified framework for self-evolving AI agents, categorizing techniques that enable agents to autonomously refine their components and interaction structures through continuous environmental feedback.
Core Problem
Most existing agent systems rely on static, manually crafted configurations (prompts, tools, workflows) that fail to adapt to dynamic environments or changing task requirements after deployment.
Why it matters:
  • Manual reconfiguration of agent systems is time-consuming, labor-intensive, and difficult to scale as user intents and external tools shift
  • Static agents struggle with lifelong learning, unable to incorporate new experiences or optimize their own architectures without human intervention
  • Current paradigms like Model Online Adaptation focus on model weights but miss the structural evolution of agentic workflows and tool use
Concrete Example: An agent assisting in customer service may encounter a newly launched product. A static agent fails because its knowledge base and response templates are fixed. A self-evolving agent would autonomously detect the failure, update its memory or toolset to include the new product data, and refine its response strategy based on user feedback.
Key Novelty
Unified Conceptual Framework for Multi-Agent Self-Evolving (MASE)
  • Proposes the 'Three Laws of Self-Evolving AI Agents': Endure (safety), Excel (performance), and Evolve (autonomous optimization)
  • Formalizes the evolution loop as four components: System Inputs, Agent System (prompts, memory, tools, topology), Environment (feedback), and Optimizers (search algorithms)
  • Categorizes evolution into single-agent component optimization (prompt, memory, tool) and multi-agent structural optimization (topology, communication)
Evaluation Highlights
  • Surveys over 50 specific self-evolving techniques (e.g., OPRO, Reflexion, GPTSwarm) across varying agent components
  • Constructs a taxonomy distinguishing between static paradigms (Model Offline Pretraining) and dynamic paradigms (Multi-Agent Self-Evolving)
  • Identifies critical open challenges in safety, evaluation, and catastrophic forgetting for lifelong agentic systems
Breakthrough Assessment
9/10
This is a foundational survey that defines a new sub-field. It organizes scattered research into a coherent framework, offering necessary definitions and taxonomies for future work in autonomous agent evolution.
×