← Back to Paper List

Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning

Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang
University of Notre Dame, Amazon.com Inc.
Conference on Empirical Methods in Natural Language Processing (2024)
Memory P13N RAG Benchmark

📝 Paper Summary

Memory internalization User-profile based personalization RAG-based personalization
OPPU assigns each user a dedicated, lightweight PEFT module that stores personal behavior patterns, allowing efficient personalization while maintaining privacy and adapting to behavior shifts better than retrieval alone.
Core Problem
Existing LLM personalization relies on centralized models using retrieval (RAG) or prompt profiles, which suffer from privacy/ownership issues and fail when retrieved history is noisy or irrelevant to new behaviors.
Why it matters:
  • Centralized processing requires users to share sensitive data with service providers, raising privacy concerns
  • Retrieval-augmented generation (RAG) struggles with 'behavior shifts' where past history doesn't semantically match current queries, distracting the LLM with irrelevant context
  • Static prompts have limited context windows and cannot capture complex, dynamic user behavior patterns effectively
Concrete Example: In a citation identification task, a user's publication history (the retrieval corpus) might be topically different from the specific paper they are currently citing. A standard retriever fetches irrelevant past papers, confusing the model. OPPU uses the user's learned parameters to identify the citation style/preference without relying solely on semantic similarity of past text.
Key Novelty
One PEFT Per User (OPPU)
  • Treats personalization as a plug-and-play modular problem: each user gets a tiny, private set of tunable parameters (like a LoRA adapter) that plugs into a frozen base LLM.
  • Integrates 'parametric knowledge' (learned user patterns in weights) with 'non-parametric knowledge' (retrieved history) for a hybrid approach that is robust even when retrieval fails.
Architecture
Architecture Figure Figure 1 (Implicit Description)
Comparison of centralized personalization (RAG/Prompting) vs. OPPU decentralized personalization.
Evaluation Highlights
  • +17.38% average relative improvement in MAE for personalized product rating prediction (LaMP-3) compared to baselines.
  • +11.87% accuracy improvement on personalized movie tagging (LaMP-2M) using OPPU compared to non-personalized baselines.
  • Achieves state-of-the-art results across all 7 tasks in the LaMP benchmark, consistently outperforming retrieval-augmented (RAG) and profile-augmented (PAG) methods.
Breakthrough Assessment
7/10
Strong empirical results across a standard benchmark (LaMP) and a practical architecture for privacy-preserving personalization. The concept of per-user LoRA is an evolutionary step in PEFT application rather than a fundamental theoretical shift.
×