| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Results on Wiki2023+ (Unmodulated) showing performance degradation by answer position (1=start, 6=end) and improvement via D-AR. | ||||
| Wiki2023+ | EM (Position 1) | 40.9 | 60.1 | +19.2 |
| Wiki2023+ | EM (Position 6 / End) | 14.9 | 30.4 | +15.5 |
| Wiki2023+ | Average EM (Positions 1-6) | 15.7 | 31.0 | +15.3 |
| Wiki2023+ | Average EM | 15.7 | 24.3 | +8.6 |
| Wiki2023+ | EM (Position 1) | 65.3 | 70.8 | +5.5 |
| Wiki2023+ | EM (Position 6 / End) | 31.7 | 46.2 | +14.5 |