| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Image recognition results showing massive gains on OOD benchmarks. | ||||
| ImageNet-S | Top-1 Accuracy | 44.73 | 97.22 | +52.49 |
| ImageNet | Top-1 Accuracy | 56.00 | 98.31 | +42.31 |
| ImageNet (Mean across 8 datasets) | Top-1 Accuracy | 93.37 | 95.71 | +2.34 |
| VQA results demonstrating improvements on reasoning tasks. | ||||
| AI2D | Accuracy | 39.68 | 67.75 | +28.07 |
| MathVista | Accuracy | 65.49 | 66.94 | +1.45 |
| AI2D | Accuracy | 51.55 | 61.09 | +9.54 |