Evaluation Setup
Simulated FL with incomplete classes using Dirichlet distribution to partition data.
Benchmarks:
- CIFAR-10 (Image Classification)
- CIFAR-100 (Image Classification)
- CINIC-10 (Image Classification)
Metrics:
- Global Aggregation Accuracy (test accuracy of global model on all classes)
- Personalization Accuracy (average test accuracy of local models on local observed classes)
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Global Aggregation Accuracy comparisons on CIFAR-10 show MAP outperforming all baselines. |
| CIFAR-10 |
Aggregation Accuracy |
78.25 |
89.91 |
+11.66
|
| CIFAR-10 |
Aggregation Accuracy |
86.32 |
89.91 |
+3.59
|
| Personalization Accuracy comparisons on CIFAR-10 show MAP achieving superior local performance. |
| CIFAR-10 |
Personalization Accuracy |
81.65 |
96.32 |
+14.67
|
| CIFAR-10 |
Personalization Accuracy |
94.38 |
96.32 |
+1.94
|
| Ablation studies demonstrate the contribution of RS and HPM components individually. |
| CIFAR-10 |
Aggregation Accuracy |
81.98 |
89.91 |
+7.93
|
| CIFAR-10 |
Personalization Accuracy |
93.45 |
96.32 |
+2.87
|
Main Takeaways
- MAP significantly outperforms FedAvg and even specialized baselines like FedROD in scenarios with high missing class rates (e.g., 0.5).
- Restricted Softmax (RS) is the primary driver for aggregation performance, preventing the 'weight divergence' caused by missing classes.
- Inherited Private Models (HPM) are key for personalization, allowing clients to retain local knowledge that might be overwritten by global updates.
- The method is robust to different levels of data heterogeneity (Dirichlet alpha) and missing class ratios.