| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| MAS4POI significantly outperforms traditional Deep Learning baselines on both datasets, demonstrating the effectiveness of the multi-agent approach. | ||||
| New York (NYC) | Acc@1 | 0.203 | 0.511 | +0.308 |
| Singapore (SIN) | Acc@1 | 0.198 | 0.444 | +0.246 |
| New York (NYC) | Acc@5 | 0.496 | 0.785 | +0.289 |
| Ablation study on LLM backbones shows that while stronger models perform better, the framework is effective across different models. | ||||
| New York (NYC) | Acc@1 | 0.457 | 0.511 | +0.054 |
| Cold-Start performance analysis (users with <15 check-ins) demonstrates robustness where traditional models fail. | ||||
| New York (NYC) | Acc@1 | 0.082 | 0.253 | +0.171 |