← Back to Paper List

Cross-Domain Recommendation Meets Large Language Models

Ajay Krishna Vajjala, Dipak Meher, Ziwei Zhu, David S. Rosenblum
arXiv (2024)
Recommendation P13N Reasoning

📝 Paper Summary

Cross-Domain Recommendation (CDR) Cold-Start Recommendation
The paper investigates the reasoning capabilities of LLMs for cross-domain recommendation by designing specific prompts that outperform specialized neural baselines in rating and ranking tasks, particularly using only source domain data.
Core Problem
Single-domain recommender systems fail on cold-start users, and existing cross-domain solutions rely on complex architectures that struggle with data scarcity and lack general reasoning capabilities.
Why it matters:
  • Existing CDR models often memorize patterns rather than reasoning, limiting generalization beyond training data
  • Data-scarce scenarios make it difficult to train complex neural networks effectively
  • Current systems cannot easily incorporate rich textual/contextual information which LLMs handle naturally
Concrete Example: A user has rated many Books (source) but no Movies (target). A traditional CDR model needs complex mapping training to predict movie ratings. An LLM can simply read the book history and infer movie preferences via natural language reasoning without task-specific training.
Key Novelty
LLM-based Cross-Domain Prompting Framework
  • Replaces complex neural mapping architectures with natural language prompts that feed user interaction history directly to an LLM
  • Introduces two prompting strategies: one simulating warm-start (source + target history) and one simulating cold-start (source history only)
  • Demonstrates that LLMs can infer target domain preferences solely from source domain behavior, sometimes performing better without target domain noise
Evaluation Highlights
  • GPT-4 achieves MRR of 0.308 on Books->Movies ranking, outperforming the best baseline (PTUPCDR) which scored 0.2596
  • GPT-4o achieves NDCG of 0.412 on Movies->Music ranking, surpassing the best baseline (PTUPCDR) score of 0.3822
  • For rating prediction, llama-3-8b-instruct performs better when using *only* source domain data compared to combining source and target data
Breakthrough Assessment
7/10
Strong empirical results showing LLMs beat specialized baselines without training. The counter-intuitive finding that source-only data sometimes yields better rating predictions than source+target is a notable insight for the field.
×