Evaluation Setup
Simulated Federated Learning environment with malicious clients injecting backdoors
Benchmarks:
- Fashion-MNIST (Image Classification)
- CIFAR-10 (Image Classification)
- CIFAR-100 (Image Classification)
- N-BaIoT (IoT Attack Detection)
Metrics:
- Attack Success Rate (ASR)
- Main Task Accuracy (ACC)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Attack performance against server-side defenses (Trimmed Mean) on Fashion-MNIST shows PFedBA's superiority over baselines. |
| Fashion-MNIST |
Attack Success Rate (ASR) |
40.5 |
58.0 |
+17.5
|
| Fashion-MNIST |
Attack Success Rate (ASR) |
0.0 |
58.0 |
+58.0
|
| Performance across different PFL algorithms without specific defenses. |
| Fashion-MNIST (FedAvg-FT) |
Attack Success Rate (ASR) |
70.0 |
99.0 |
+29.0
|
| Attack performance against client-side defense (Neural Cleanse). |
| CIFAR-10 |
Attack Success Rate (ASR) |
12.0 |
85.0 |
+73.0
|
Main Takeaways
- Personalization (fine-tuning) in PFL acts as a natural defense, reducing ASR of standard attacks (like Sybil or naive poisoning) significantly.
- PFedBA consistently outperforms all baselines (Sybil, PGD, Neurotoxin, CerP) across 10 different PFL algorithms (including partial and full model sharing).
- Gradient alignment effectively couples the backdoor task with the main task, making it resistant to 'catastrophic forgetting' during personalization.
- Even when combined with robust aggregation (e.g., Trimmed Mean) or client-side sanitization (Neural Cleanse), PFedBA retains significant attack effectiveness where others fail.