← Back to Paper List

EasyRec: Simple yet Effective Language Models for Recommendation

X Ren, C Huang
The University of Hong Kong
arXiv, 8/2024 (2024)
Recommendation P13N

πŸ“ Paper Summary

Recommender Systems Language Models for Recommendation Zero-Shot Learning
EasyRec enhances recommender systems by aligning text-based user/item profiles generated by LLMs with collaborative signals through a contrastive learning framework, enabling effective zero-shot performance.
Core Problem
Existing deep collaborative filtering methods rely heavily on unique IDs, struggling with data sparsity and zero-shot scenarios where user/item IDs are unseen.
Why it matters:
  • ID-based models cannot generalize to new domains or time periods where specific user/item tokens differ.
  • Current LLM-based recommenders are either inefficient (slow inference) or fail to effectively capture high-order collaborative signals alongside semantic information.
Concrete Example: A user interested in 'AI development' might be recommended 'Sci-Fi novels' by a purely semantic model because both contain 'AI', whereas a collaborative model would know AI researchers typically buy technical books, not fiction. EasyRec aims to bridge this semantic-collaborative gap.
Key Novelty
Text-Behavior Alignment via Collaborative Language Modeling
  • Generates rich textual profiles for users and items using LLMs (e.g., LLaMA) that incorporate interaction history (reviews, titles) to reflect collaborative signals in text.
  • Fine-tunes a lightweight bidirectional Transformer using contrastive learning to align these semantic text embeddings with collaborative behavior patterns.
  • Employes an LLM-based profile diversification strategy (rephrasing profiles) to augment data and improve generalization.
Evaluation Highlights
  • Outperforms state-of-the-art baselines like BM25 and BERT by significant margins in zero-shot scenarios (e.g., +13.5% over Universal-U on Amazon-Beauty).
  • Achieves high efficiency with ~0.01 seconds per prediction, compared to ~1.0 second for generative LLM-based recommenders.
  • Demonstrates scaling law behavior where performance improves consistently as the underlying language model size increases (from 100M to 400M parameters).
Breakthrough Assessment
7/10
Strong practical contribution. Effectively bridges the gap between ID-based CF and semantic LLMs with a lightweight, efficient architecture. The zero-shot performance is impressive, though the core components (contrastive learning, BERT encoders) are established techniques applied novelly.
×