| Benchmark | Metric | Baseline | This Paper | Ξ |
|---|---|---|---|---|
| Main comparison results demonstrating ToolScope's improvement over baselines across different LLMs and datasets. | ||||
| Seal-Tools | CSR@10 | 58.42 | 93.00 | +34.58 |
| UltraTool | CSR@10 | 26.36 | 65.00 | +38.64 |
| BFCL | CSR@10 | 89.00 | 97.80 | +8.80 |
| Impact of Auto-Correction module on performance. | ||||
| UltraTool | CSR | 57.1 | 65.0 | +7.9 |
| Context length reduction analysis. | ||||
| Seal-Tools | Context Tokens | 292107 | 317 | -291790 |