| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance on RealScholarQuery (real-world queries) showing PaSa-7B's dominance over search engines and larger models. | ||||
| RealScholarQuery | Recall@20 | See delta | See delta | +37.78% |
| RealScholarQuery | Recall@50 | See delta | See delta | +39.90% |
| RealScholarQuery | Recall | See delta | See delta | +30.36% |
| Performance on AutoScholarQuery (synthetic test set) confirming the trends seen in real-world data. | ||||
| AutoScholarQuery Test | Recall@20 | See delta | See delta | +34.05% |
| AutoScholarQuery Test | Recall@50 | See delta | See delta | +39.36% |