| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| ML-Master outperforms all baselines on the MLE-Bench leaderboard in terms of overall medal rate. | ||||
| MLE-Bench | Average Medal Rate | 22.4% | 29.3% | +6.9% |
| MLE-Bench | Average Medal Rate | 13.6% | 29.3% | +15.7% |
| MLE-Bench | Average Medal Rate | 10.0% | 29.3% | +19.3% |
| Performance breakdown by difficulty shows ML-Master's specific strength in medium-difficulty tasks. | ||||
| MLE-Bench (Medium Tasks) | Medal Rate | 9.0% | 20.2% | +11.2% |