| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance on ToolBench (Average of I2 and I3 subsets). EASYTOOL improves Success Rate across most models. | ||||
| ToolBench | Success Rate | 64.3 | 72.8 | +8.5 |
| ToolBench | Success Rate | 62.3 | 69.8 | +7.5 |
| ToolBench | Success Rate | 61.0 | 70.5 | +9.5 |
| ToolBench (I2 + I3) | NDCG@5 | 38.8 | 85.6 | +46.8 |
| RestBench (TMDB) | Correct Path Rate | 45.0 | 65.0 | +20.0 |