← Back to Paper List

Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models

Chengkai Huang, Junda Wu, Yu Xia, Zixu Yu, Ruhan Wang, Tong Yu, Ruiyi Zhang, Ryan A. Rossi, B. Kveton, Dongruo Zhou, Julian J. McAuley, Lina Yao
University of New South Wales, University of California San Diego, Indiana University, Adobe Research
arXiv.org (2025)
Recommendation Agent MM Memory P13N

📝 Paper Summary

Agentic AI Recommender Systems (RS)
This perspective paper formalizes LLM-based Agentic Recommender Systems (LLM-ARS), proposing a unified architecture integrating profiling, planning, memory, and action to transition RS from static ranking to autonomous, proactive assistants.
Core Problem
Traditional Recommender Systems are reactive, relying on static ID-based features and implicit feedback, which limits their ability to handle open-ended goals, plan complex tasks, or proactively adapt to evolving user intents.
Why it matters:
  • Current systems conflate transient actions with enduring preferences due to reliance on implicit feedback (e.g., clicks), lacking transparency in why a recommendation was made.
  • Existing RS cannot effectively integrate open-domain knowledge or multimodal signals, limiting performance in complex, cross-platform scenarios.
  • The static, one-directional nature of traditional RS prevents users from iteratively refining suggestions through natural language, failing to align with human decision-making processes.
Concrete Example: In a traditional RS, a user searching for 'dinner' gets a static list of restaurants based on click history. In an Agentic RS, the system acts as a concierge: it autonomously plans a full evening by booking a restaurant, selecting a movie that fits the time slot, and arranging transport, iteratively refining the plan based on the user's real-time feedback.
Key Novelty
Formal Framework for Agentic Recommender Systems (LLM-ARS)
  • Proposes a four-level evolutionary taxonomy for RS: from Static (Level 1) and Intelligent (Level 2) to Agentic (Level 3), distinguishing reactive systems from autonomous ones.
  • Defines a unified modular architecture comprising four key components: User Profiling (dynamic state tracking), Memory (long/short-term storage), Planning (reasoning/strategy), and Action (tool use/execution).
Breakthrough Assessment
7/10
While a perspective paper without new experimental results, it provides a crucial formalization and taxonomy for the emerging field of Agentic RS, unifying disparate existing works into a coherent framework.
×