Evaluation Setup
Federated Learning simulation with K=100 clients under severe label skew (Dirichlet partition).
Benchmarks:
- MNIST (Image Classification)
- FMNIST (Fashion-MNIST) (Image Classification)
Metrics:
- Test Accuracy
- Communication Rounds
- Total Communication Overhead
- Statistical methodology: Results averaged over five random seeds.
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| FedLECC achieves superior test accuracy compared to state-of-the-art baselines under severe label skew settings. |
| FMNIST |
Test Accuracy |
73.2 |
82.0 |
+8.8
|
| FMNIST |
Communication Rounds reduction |
150 |
117 |
-33
|
| FMNIST |
Total Communication Overhead reduction |
100 |
50 |
-50
|
Main Takeaways
- FedLECC consistently converges faster and reaches higher accuracy than uniform sampling (FedAvg) and single-factor selection (POC) under severe non-IID data.
- The combination of clustering (diversity) and loss-guidance (informativeness) effectively mitigates client drift caused by label skew.
- Significant reduction in communication overhead makes it suitable for bandwidth-constrained cloud-edge environments.
- Outperforms purely regularization-based methods (FedProx, FedDyn) in the high non-IID regime by ensuring better data representation in each round.