Evaluation Setup
Image Classification on standard benchmarks
Benchmarks:
- CIFAR-10 (Image Classification)
- CIFAR-100 (Image Classification)
- Tiny-ImageNet (Image Classification)
- ImageNet (Image Classification)
Metrics:
- Top-1 Error Rate (%)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Comparative results on CIFAR-100 showing XSAM consistently achieving lower error rates than SGD and SAM across multiple architectures. |
| CIFAR-100 |
Top-1 Error Rate |
17.05 |
16.50 |
-0.55
|
| CIFAR-100 |
Top-1 Error Rate |
26.85 |
26.35 |
-0.50
|
| CIFAR-10 |
Top-1 Error Rate |
Not reported in the paper |
Not reported in the paper |
Not reported in the paper
|
| CIFAR-100 |
Top-1 Error Rate |
16.55 |
16.50 |
-0.05
|
| CIFAR-100 |
Top-1 Error Rate |
Not reported in the paper |
Not reported in the paper |
Not reported in the paper
|
Main Takeaways
- XSAM consistently outperforms SAM and its variants across various architectures (ResNet, VGG, DenseNet) and datasets.
- The approximation quality of the update direction in standard SAM degrades as the number of ascent steps increases; XSAM mitigates this.
- Visualizations confirm that while the single-step ascent gradient is better than the local gradient, it is still inaccurate; explicit estimation corrects this.