| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Retrieval performance comparisons show the proposed MPG method outperforms baselines on Yelp data. | ||||
| Yelp (Philadelphia) | Recall@20 | 0.0543 | 0.0664 | +0.0121 |
| Yelp (Philadelphia) | Precision@20 | 0.0033 | 0.0039 | +0.0006 |
| Re-ranking experiments demonstrate that adding an LLM ranker improves over pure retrieval. | ||||
| Yelp (Philadelphia) | Recall@3 | 0.0124 | 0.0381 | +0.0257 |
| Yelp (Philadelphia) | Precision@3 | 0.0051 | 0.0152 | +0.0101 |