| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| ThinkMorph demonstrates massive improvements on vision-centric tasks where the base model fails significantly. | ||||
| Spatial Navigation (VSP) | Accuracy | 0.83 | 86.67 | +85.84 |
| SAT (Spatial) | Accuracy | 49.33 | 52.67 | +3.34 |
| MMVP | Accuracy | 70.33 | 80.33 | +10.00 |
| BLINK-J | Accuracy | 65.33 | 73.33 | +8.00 |
| MMVP (Switched Subset) | Accuracy | 73.96 | 81.25 | +7.29 |