← Back to Paper List

Memory Bear AI A Breakthrough from Memory to Cognition Toward Artificial General Intelligence

Deliang Wen, Ke Sun
arXiv (2025)
Memory P13N KG Factuality

πŸ“ Paper Summary

Graph-based memory Layered memory Cognitive Architecture
Memory Bear integrates cognitive science principles, specifically ACT-R and the Ebbinghaus forgetting curve, into a three-layer LLM memory architecture to enable human-like active forgetting, emotional weighting, and self-reflective memory consolidation.
Core Problem
LLMs suffer from limited context windows, 'context drift' where they lose the original topic over time, and ineffective retrieval caused by token redundancy, leading to hallucinations and high costs.
Why it matters:
  • In healthcare, chronic disease management requires tracking patient history over years, which current context windows cannot support
  • Enterprise assistants need to retrieve decision histories from months prior without being distracted by irrelevant chit-chat
  • Current memory mechanisms lack 'active forgetting,' leading to bloated prompts that increase costs and distract the model's attention mechanism
Concrete Example: In a multi-turn dialogue, if a user asks about a product's warranty three times and confirms each answer with 'Okay', traditional approaches concatenate all repeated Q&A pairs and confirmations into the context. This redundancy wastes tokens and dilutes attention. Memory Bear identifies the semantic repetition and merges these into a single fact node, discarding the 'Okay' confirmations.
Key Novelty
Cognitively-Grounded Memory Orchestration
  • Implements a 'sleep' mechanism (Self-Reflection Engine) that periodically reorganizes memory graphs and resolves conflicts offline, similar to human memory consolidation
  • Applies an 'active forgetting' mechanism based on the Ebbinghaus curve, where memory nodes decay in activation value over time unless reinforced by retrieval or emotional weight
  • Distinguishes between 'Explicit Memory' (declarative graph) and 'Implicit Memory' (procedural patterns/habits), independent of LLM parameters
Evaluation Highlights
  • Reduces token usage during inference by ~90% compared to full-context inputs via intelligent semantic pruning
  • Improves inference accuracy by 15% by removing redundant tokens that cause attention dispersion
  • Decreases off-topic responses (context drift) by 70% in long-dialogue scenarios
Breakthrough Assessment
7/10
Strong conceptual innovation by engineering specific cognitive theories (ACT-R, Ebbinghaus) into a practical system architecture. Claims impressive efficiency gains (>90% token reduction), though the text is an architectural proposal with narrative results rather than a full benchmark suite.
×