← Back to Paper List

ReLLa: Retreival-enhanced LLMs for Lifelong sequential behavior comprehension in recommendation

Shagun Sodhani, Mojtaba Faramarzi, Sanket Vaibhav Mehta, Pranshu Malviya, Mohamed Abdelsalam, Janarthanan Janarthanan, Sarath Chandar
Fundamental AI Research, Meta AI, Universite de Montreal, Carnegie Mellon University, Polytechnique Montréal, Quebec Artificial Intelligence Institute
WWW (2024)
Memory Benchmark QA

📝 Paper Summary

Catastrophic Forgetting Knowledge Transfer Continual Learning Strategies
This primer provides a comprehensive overview of lifelong supervised learning, categorizing approaches into regularization, memory, and architecture-based strategies while defining key scenarios and metrics.
Core Problem
Standard machine learning models suffer from catastrophic forgetting when trained on sequential tasks and lack the ability to transfer knowledge effectively between tasks (forward and backward transfer).
Why it matters:
  • Current AI systems are data-hungry and computationally expensive because they cannot incrementally accumulate knowledge like humans do
  • Retraining models from scratch for every new task is inefficient and hinders adaptation to open-ended environments
  • Existing paradigms like Transfer Learning typically only focus on forward transfer (improving current task) rather than maintaining performance on all previous tasks
Concrete Example: Imagine a system that has to learn the alphabet every time it reads a book. Because it cannot transfer knowledge across the tasks of learning alphabets and reading books, it has poor sample complexity. In contrast, humans incrementally acquire knowledge without forgetting.
Key Novelty
Unified Taxonomy of Lifelong Learning
  • Categorizes learning strategies into three distinct families: Regularization (constraining parameter changes), Memory (replaying past data), and Architecture (isolating/expanding parameters)
  • Formalizes the distinction between Domain-Incremental (unknown task ID, changing input dist), Task-Incremental (known task ID), and Class-Incremental (unknown task ID, changing output space) scenarios
  • Consolidates evaluation metrics beyond accuracy, specifically defining Forgetting Measure (backward loss) and Intransigence (inability to learn new tasks)
Breakthrough Assessment
8/10
A foundational primer that organizes a fragmented field. While it is a survey/intro rather than a new method, its structural taxonomy and rigorous definition of scenarios are essential for researchers.
×