← Back to Paper List

RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation

S Zhou, W Gan, Q Liu, K Lei, J Zhu, H Huang, Y Xia…
Zhejiang University, Huawei Noah’s Ark Lab, Shanghai AI Lab, The Hong Kong Polytechnic University
arXiv, 9/2025 (2025)
Recommendation Pretraining P13N

📝 Paper Summary

LLM-based recommendation Generative recommendation Zero-shot recommendation
RecBase is a foundation model pretrained from scratch on cross-domain recommendation data using a unified hierarchical item tokenizer to enable effective zero-shot transfer.
Core Problem
Existing LLM-based recommenders rely on language-centric pretraining that struggles to capture item-level sequential patterns, while ID-based models fail to generalize across domains due to disjoint vocabularies.
Why it matters:
  • The knowledge gap between language modeling and recommendation tasks limits the ability of standard LLMs to model item co-relationships effectively.
  • Traditional ID-based recommenders cannot handle zero-shot scenarios or new domains because item IDs are not transferable.
  • Mapping recommendation data directly to natural language is often verbose and may not effectively represent user behavioral sequences.
Concrete Example: When a standard LLM is asked to predict the next item for a user who bought a specific sequence of products, it often hallucinates or suggests generally popular items rather than personalized ones because it lacks specific collaborative signal knowledge. RecBase, by pretraining on 35M interactions across 15 domains, learns these specific sequential patterns.
Key Novelty
Curriculum Learning Enhanced RQ-VAE (CL-VAE) for Unified Item Tokenization
  • Standardizes item representations across domains by converting textual descriptions into hierarchical, discrete concept IDs using a shared encoder.
  • Uses curriculum learning to progressively train the quantization codebooks from coarse to fine, preventing codebook collapse and ensuring better utilization of the token space.
  • Pretrains an autoregressive Transformer on these discrete concept ID sequences across diverse domains to learn universal recommendation patterns.
Architecture
Architecture Figure Figure 1
Overview of the RecBase framework, illustrating the two-stage process: (1) Item Tokenization via CL-VAE and (2) Autoregressive Pretraining.
Evaluation Highlights
  • RecBase-1.5B outperforms Llama-3-8B and Qwen-2-7B on zero-shot ranking tasks across 8 unseen datasets (e.g., +4.7% AUC on H&M, +2.4% AUC on Steam).
  • RecBase-0.3B (313M parameters) surpasses larger language models like OPT-1.3B and BERT-base in zero-shot performance while being significantly more efficient.
  • Fine-tuning yields further gains: +17.2% AUC improvement on Steam and +8.6% on MovieLens compared to zero-shot performance.
Breakthrough Assessment
8/10
Strong contribution in creating a true 'foundation model' for recommendation (trained from scratch on rec data) rather than just adapting an LLM. The unified tokenizer is a significant technical enabler for cross-domain transfer.
×