| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Comparative analysis against baselines showing ToolRec's superiority, particularly in semantic-rich domains. | ||||
| MovieLens-1M | HR@10 | 0.1982 | 0.2246 | +0.0264 |
| MovieLens-1M | NDCG@10 | 0.1305 | 0.1583 | +0.0278 |
| Amazon-Beauty | HR@10 | 0.0601 | 0.0689 | +0.0088 |
| Amazon-Sports | NDCG@10 | 0.0235 | 0.0267 | +0.0032 |
| Ablation studies validating the necessity of both retrieval and ranking tools. | ||||
| MovieLens-1M | HR@10 | 0.1472 | 0.2246 | +0.0774 |
| MovieLens-1M | HR@10 | 0.1654 | 0.2246 | +0.0592 |