| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Scaling trends for rollout allocation (n) show saturation points that differ by problem difficulty. | ||||
| Guru-Math (Easy) | Optimal n (rollouts) | Small n | 512 | Saturation point |
| Guru-Math (Hard) | Optimal n (rollouts) | 512 | Lower saturation | Lower |