| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Main comparison on WeMusic-Bench showing WeMusic-Agent's superiority over baselines in recommendation accuracy. | ||||
| WeMusic-Bench | SR@10 | 17.06 | 47.62 | +30.56 |
| WeMusic-Bench | NDCG@10 | 10.15 | 24.16 | +14.01 |
| Efficiency analysis demonstrating reduced tool dependency. | ||||
| WeMusic-Bench | Tool Call Rate | 100.0 | 69.1 | -30.9 |
| WeMusic-Bench | Inference Time (s/sample) | 4.55 | 2.89 | -1.66 |