← Back to Paper List

DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation

Bowen Zheng, Xiaolei Wang, Enze Liu, Xi Wang, Lu Hongyu, Yu Chen, Wayne Xin Zhao, Ji-Rong Wen
Renmin University of China, Beijing Institute of Technology, WeChat, Tencent
arXiv (2025)
Recommendation Agent RL Reasoning

📝 Paper Summary

Sequential Recommendation LLM Agents for Recommendation
DeepRec enables LLMs to autonomously interact with traditional recommendation models via multi-turn reasoning and retrieval to deeply explore the item space before ranking.
Core Problem
Existing LLM-based recommenders perform 'shallow' exploration, either merely enhancing traditional models with features or conducting a single retrieval-then-rank step without iterative reasoning.
Why it matters:
  • Sequential recommendation requires complex reasoning over evolving user preferences that static traditional models cannot capture
  • Current LLM approaches fail to leverage the generative reasoning capable of refining search queries iteratively (like a human researcher)
  • Fine-tuning LLMs as standalone recommenders is computationally expensive and struggles to keep up with dynamic item pools
Concrete Example: A traditional model might retrieve items based solely on past clicks. An LLM-enhanced model might just re-rank those fixed candidates. DeepRec, however, acts like an agent: it generates a thought about the user's history, queries the traditional model with a specific preference description, analyzes the returned items, and decides whether to search again or finalize the list.
Key Novelty
Autonomous Reasoning-Retrieval Paradigm
  • Treats the Traditional Recommendation Model (TRM) as a 'tool' that an LLM agent can invoke multiple times via generated text commands
  • Uses a 'Preference-Aware' TRM that fuses user history embeddings with the LLM's generated text preference to retrieve more semantically relevant candidates
  • Employes a two-stage Reinforcement Learning strategy to first teach the LLM how to interact with the tool (Cold-Start) and then optimize for recommendation accuracy (Recommendation-Oriented)
Architecture
Architecture Figure Figure 2
The overall architecture of DeepRec, illustrating the autonomous multi-turn reasoning-retrieval process.
Breakthrough Assessment
7/10
Novel application of agentic patterns (Deep Research) to recommendation. The multi-turn retrieval with a modified TRM tool is a logical but significant step forward from simple re-ranking.
×