← Back to Paper List

Dual-Personalizing Adapter for Federated Foundation Models

Yiyuan Yang, Guodong Long, Taoshu Shen, Jing Jiang, Michael Blumenstein
Australian AI Institute, University of Technology Sydney
Neural Information Processing Systems (2024)
P13N Pretraining Benchmark

📝 Paper Summary

Federated Foundation Models Personalized Federated Learning (PFL) Parameter-Efficient Fine-Tuning (PEFT)
FedDPA employs a dual-adapter architecture with instance-wise dynamic weighting to balance local personalization with robustness to unseen test-time distribution shifts in federated foundation models.
Core Problem
Existing Federated Foundation Models align well with local training data but fail to handle test-time distribution shifts, where clients encounter new tasks or domains during inference that differ from their training distributions.
Why it matters:
  • Real-world client needs are dynamic; a user training on English emails may suddenly need Chinese translation, requiring the model to generalize beyond its specific personalization
  • Current PFL methods optimize solely for the local distribution, creating a trade-off where personalization degrades performance on novel test-time tasks (unseen in training)
Concrete Example: A client typically writes emails in English (training data) but later requires translation assistance for a new project in Chinese (test-time shift). A standard personalized model overfitted to English emails fails to provide the generic translation capability needed.
Key Novelty
Federated Dual-Personalizing Adapter (FedDPA)
  • Maintains two distinct adapters per client: a Global Adapter that learns generic knowledge from the federated aggregation, and a Local Adapter that focuses on client-specific personalization
  • Uses an instance-wise dynamic weighting mechanism during inference to autonomously determine the proportional contribution of each adapter for a given test instance
Architecture
Architecture Figure Text description (Figure 1 implied)
Conceptual framework of FedDPA showing the dual-adapter mechanism
Breakthrough Assessment
7/10
Novel formulation of 'test-time personalization' in FedFM. The dual-adapter approach addresses the specific conflict between personalization and generalization, though the core concept of mixing global/local modules exists in broader FL.
×