| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Experiments on the 2D Toy Model demonstrate the clear progression from memorization to spurious states to generalization as training data size increases. | ||||
| 2D Toy Model | Energy Landscape Topology | 2 minima (Memorization) | Emergent minima between points (Spurious) | Topology change |
| Distance histograms on high-dimensional data (CIFAR-10/CelebA) reveal bimodal distributions used to classify spurious states. | ||||
| CIFAR-10 / CelebA | Nearest Neighbor Distance Histograms | N/A | Bimodal distribution | Existence of distinct groups |