← Back to Paper List

Language Models Encode Collaborative Signals in Recommendation

L Sheng, A Zhang, Y Zhang, Y Chen, X Wang, TS Chua
Tsinghua University, Université de Montréal
arXiv, 7/2024 (2024)
Recommendation P13N

📝 Paper Summary

LLM-based recommendation Collaborative Filtering (CF) Representation Learning
AlphaRec demonstrates that simple linear projections of item text representations from frozen language models can outperform traditional ID-based collaborative filtering methods by uncovering implicit user preference signals.
Core Problem
Traditional recommender systems rely on ID-based embeddings that lack semantic richness and transferability, while it remains unclear if Language Models (LMs) implicitly encode user behavioral preferences within their language representations.
Why it matters:
  • Prevailing wisdom assumes distinct spaces for language and user behavior, requiring complex alignment strategies that may be unnecessary
  • ID-based models struggle with cold-start problems and transferring knowledge across different datasets due to the lack of shared semantics
  • Understanding if LMs inherently encode collaborative signals could simplify recommender architecture design significantly
Concrete Example: In traditional systems, two movies like 'Godzilla' and 'King Kong' might only be linked if many users watched both. The paper shows that an LM's language representations, even without explicit behavioral training, can be linearly mapped to cluster these items together based on latent user preferences (e.g., 'monster movies'), whereas semantic similarity alone might separate them if their descriptions differ.
Key Novelty
Homomorphism between Language and Behavior Spaces
  • Empirically proves that language representations of item titles can be linearly mapped to a behavior space that accurately predicts user interactions
  • Discovers that this mapping capability scales with model size (larger LMs yield better recommendations) and is robust to prompt noise
  • Proposes AlphaRec: a streamlined model using only frozen LM representations, a simple MLP projector, and contrastive loss, discarding ID embeddings entirely
Architecture
Architecture Figure Figure 1(a)
Conceptual framework of the Linear Mapping approach in AlphaRec.
Evaluation Highlights
  • AlphaRec with Llama2-7B outperforms the leading ID-based baseline LightGCN on the Amazon 'CDs and Vinyl' dataset (Recall@20: 0.1656 vs 0.1557)
  • Zero-shot performance: AlphaRec trained on 'Books' achieves comparable performance to a fully trained LightGCN on 'Movies and TV' without any fine-tuning on the target dataset
  • Linear mapping performance scales with model size: Llama2-70B consistently outperforms Llama2-7B and 13B across multiple datasets
Breakthrough Assessment
8/10
Challenges the fundamental assumption that language and behavior spaces are distinct, showing that frozen LMs can replace ID embeddings with superior performance and zero-shot transferability.
×