← Back to Paper List

Data-efficient Fine-tuning for LLM-based Recommendation

X Lin, W Wang, Y Li, S Yang, F Feng, Y Wei, TS Chua
City University of Hong Kong, Zhejiang University
arXiv, 1/2024 (2024)
Recommendation P13N

📝 Paper Summary

LLM-based Recommendation Data Pruning / Coreset Selection
DEALRec identifies a small, influential subset of data for fine-tuning recommender LLMs by combining an efficient influence estimation from a surrogate model with an effort score measuring LLM-specific difficulty.
Core Problem
Fine-tuning LLMs on massive, rapidly updating recommendation data is computationally prohibitive, while random few-shot sampling misses crucial representative patterns.
Why it matters:
  • Recommendation data grows explosively (e.g., TikTok has ~942B interactions/day), requiring frequent updates that are too costly for full LLM fine-tuning
  • Existing coreset selection methods (heuristic or optimization-based) rely on training the full model first, which defeats the purpose of efficiency for LLMs
Concrete Example: A randomly sampled few-shot dataset might miss trending items or specific user behaviors. Conversely, using a traditional surrogate model to pick data might select samples that are hard for the surrogate but trivial for a pre-trained LLM, leading to suboptimal adaptation.
Key Novelty
Two-stage Influence-Effort Scoring (DEALRec)
  • Efficiently estimates the 'influence score' (impact of removing a sample on global loss) using a small surrogate model and a symmetric HVP approximation to avoid retraining
  • Introduces an 'effort score' (gradient norm of the LLM on the sample) to correct the surrogate's bias, prioritizing samples that are specifically difficult for the LLM to learn
Architecture
Architecture Figure Figure 2
The overall framework of DEALRec.
Evaluation Highlights
  • Surpasses full-data fine-tuning performance using only 2% of training samples on MovieLens-1M
  • Reduces time costs by 97% compared to full fine-tuning while maintaining competitive accuracy
  • Outperforms random sampling and heuristic coreset methods (e.g., Entropy, Geometry) across three datasets (MovieLens, Amazon Beauty, Amazon Games)
Breakthrough Assessment
8/10
Significantly improves practical viability of LLM recommenders by slashing compute costs (97% reduction) while maintaining or exceeding accuracy. The decoupling of influence (surrogate) and effort (LLM) is a clever engineering solution.
×