| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| ResT achieves state-of-the-art results on the Berkeley Function Calling Leaderboard (BFCL), outperforming baselines and even larger proprietary models. | ||||
| BFCL | Overall Accuracy | 78.43 | 87.19 | +8.76 |
| BFCL | Single-turn Tool Use | 86.08 | 90.19 | +4.11 |
| BFCL | Multi-turn Base | 90.00 | 91.50 | +1.50 |
| Ablation studies confirm the effectiveness of the dynamic curriculum strategy. | ||||
| BFCL | Overall Accuracy | 82.33 | 87.19 | +4.86 |