← Back to Paper List

pFedSim: Similarity-Aware Model Aggregation Towards Personalized Federated Learning

Jia-Jia Tan, Yipeng Zhou, Gang Liu, Jessie Hui Wang, Shui Yu
Shenzhen University, P.R. China, Macquarie University, Australia, Tsinghua University, P.R. China, University of Technology Sydney, Australia
arXiv.org (2023)
P13N

📝 Paper Summary

Personalized Federated Learning (pFL) Non-IID Data Handling
pFedSim improves personalized Federated Learning by decoupling models into feature extractors and classifiers, using classifier distances to identify similar clients for aggregation without exposing external data.
Core Problem
In Federated Learning, non-IID data distributions across clients cause single global models to perform poorly, but existing personalization methods often require exposing sensitive metadata or incur high communication costs.
Why it matters:
  • Data heterogeneity (non-IID) is a fundamental challenge in FL, potentially causing model divergence or severe performance drops on local data.
  • Existing similarity-based pFL methods often risk privacy by requiring sharing of label distributions or auxiliary data representations.
  • Model decoupling approaches typically train feature extractors globally without personalization, missing opportunities for finer-grained adaptation.
Concrete Example: Consider two clients: one with images of dogs and cats (labels 0-1), another with cars and trucks (labels 8-9). A standard FL model averages their weights, degrading performance for both. pFedSim detects they are dissimilar via their classifiers and aggregates their feature extractors only with other similar clients (e.g., other animal-image holders).
Key Novelty
Similarity-Aware Model Decoupling
  • Decouples neural networks into a 'feature extractor' and a 'classifier'; the classifier is kept local to capture personalization.
  • Uses the distance between local classifiers as a proxy for data similarity, enabling the server to aggregate feature extractors only from clients with similar data distributions.
  • Operates in two phases: a 'Generalization' warm-up phase (standard FedAvg) followed by a 'Personalization' phase where aggregation weights are adjusted based on the classifier similarity matrix.
Evaluation Highlights
  • Achieves highest accuracy across CIFAR-10, CINIC-10, Tiny-ImageNet, and EMNIST compared to 11 baselines.
  • Improves model accuracy by up to ~10% on Tiny-ImageNet (Dirichlet 0.1) compared to FedAvg.
  • Outperforms state-of-the-art pFL method FedAP by ~22% on Tiny-ImageNet (Dirichlet 0.1) without requiring public data or sharing batch-norm statistics.
Breakthrough Assessment
7/10
Strong empirical results and a privacy-friendly design for similarity estimation. While the components (decoupling, similarity aggregation) exist, the specific combination using classifier weights as a proxy is effective and practical.
×