← Back to Paper List

Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI

Samarth Sarin, Lovepreet Singh, Bhaskarjit Sarmah, Dhagash Mehta
arXiv (2025)
Memory Agent KG P13N

📝 Paper Summary

Agentic Memory Personalization
Memoria augments LLMs with persistent memory by combining dynamic session summarization and a recency-weighted knowledge graph to maintain user context and persona across distinct conversations without fine-tuning.
Core Problem
Standard LLM chat systems are stateless, treating each interaction in isolation and failing to retain user preferences or context across different sessions.
Why it matters:
  • Lack of memory forces users to repeatedly introduce themselves or restate preferences, increasing interaction friction
  • Current vector-only memory systems lack interpretability and conflict resolution, while graph-based systems often struggle with scalability and recency
  • Business applications like financial advisory or customer support require maintaining long-term continuity to build user trust and personalization
Concrete Example: If a user previously identified as an equity trader in a past session, a standard stateless agent will forget this context in a new session. Memoria retains this via a Knowledge Graph, allowing the agent to personalize future financial updates immediately without re-prompting.
Key Novelty
Hybrid Memory Architecture with Recency-Weighted Knowledge Graph
  • Combines two memory types: SQL-based structured logs for raw history/summaries (short-term) and a dynamic Knowledge Graph for user persona (long-term)
  • Introduces a 'weighted' KG engine that applies exponential decay to memory triplets, prioritizing recent interactions to resolve conflicts in user preferences over time
Architecture
Architecture Figure Figure 1
Operational scenarios for Memoria: differentiating between New Users (no context) and Repeat Users (context retrieval).
Breakthrough Assessment
7/10
A strong architectural framework addressing the specific gap of combining short-term coherence (summaries) with long-term personalization (KG) in a modular, model-agnostic way.
×