← Back to Paper List

Memory in the Age of AI Agents: A Survey Forms, Functions and Dynamics

Yuyang Hu, Shichun Liu, Yanwei Yue, Guibin Zhang, Boyang Liu, Fangyi Zhu, Jiahang Lin, Honglin Guo, Shihan Dou, Zhiheng Xi, Senjie Jin, Jiejun Tan, Yanbin Yin, Jiongnan Liu, Zeyu Zhang, Zhongxiang Sun, Yutao Zhu, Hao Sun, Boci Peng, Zhenrong Cheng, Xuanbo Fan, Jiaxin Guo, Xinlei Yu, Zhenhong Zhou, Zewen Hu, Jiahao Huo, Junhao Wang, Yuwei Niu, Yu Wang, Zhenfei Yin, et al.
National University of Singapore, Renmin University of China, Fudan University, Peking University, Nanyang Technological University, Tongji University, University of California San Diego, Hong Kong University of Science and Technology (Guangzhou), Griffith University, Georgia Institute of Technology, OPPO, Oxford University
arXiv (2026)
Memory Agent Benchmark RAG RL MM

📝 Paper Summary

Memory organization Memory recall Agent evolution
This survey unifies fragmented research on agent memory into a coherent taxonomy defined by Forms (token/parametric/latent), Functions (factual/experiential/working), and Dynamics (formation/evolution/retrieval).
Core Problem
Research on agent memory is fragmented with inconsistent terminology, where concepts like 'LLM memory', 'RAG', and 'agent memory' are conflated, hindering systematic development.
Why it matters:
  • Lack of standardized definitions makes it difficult to compare mechanisms like short-term memory versus context window engineering
  • Existing taxonomies fail to capture emerging 2025 trends such as memory-augmented test-time scaling or reusable tool distillation
  • Without clear conceptual boundaries, researchers cannot effectively distinguish between static knowledge retrieval (RAG) and self-evolving agentic experience
Concrete Example: Early systems like MemoryBank and MemGPT were framed as 'LLM memory' solutions for extending context, but they are functionally 'agent memory' because they enable decision-making entities to track user preferences and accumulate experience across multi-turn interactions, unlike purely architectural context extensions.
Key Novelty
Unified Forms-Functions-Dynamics Taxonomy
  • Distinguishes Agent Memory from RAG and Context Engineering: Agent memory focuses on persistent, self-evolving cognitive states rather than just static retrieval or resource management
  • Classifies memory by Form: Token-level (discrete units like text), Parametric (weights), and Latent (hidden states)
  • Classifies memory by Function: Factual (world knowledge), Experiential (procedural skills/cases), and Working (current task workspace)
Evaluation Highlights
  • Compiles a list of key benchmarks including LoCoMo, LongMemEval, GAIA, and SWE-bench Verified for evaluating long-horizon agent capabilities
  • Identifies 3 distinct memory forms (Token, Parametric, Latent) and 3 functional types (Factual, Experiential, Working) to categorize existing literature
Breakthrough Assessment
9/10
Provides a highly necessary, rigorous conceptual framework that cleans up a confused field. It successfully disentangles agent memory from RAG and context engineering, setting a standard for future research.
×