| Benchmark | Metric | Baseline | This Paper | Ξ |
|---|---|---|---|---|
| Performance on TR-Bench In-Domain and Out-of-Domain settings compared to strong baselines. | ||||
| TR-Bench (In-Domain) | NDCG@5 | 0.7578 | 0.8037 | +0.0459 |
| TR-Bench (In-Domain) | Recall@5 | 0.8650 | 0.9280 | +0.0630 |
| TR-Bench (Out-of-Domain) | NDCG@5 | 0.5510 | 0.6272 | +0.0762 |
| TR-Bench (Out-of-Domain) | Recall@5 | 0.6860 | 0.8250 | +0.1390 |
| Ablation study showing the impact of the iterative feedback mechanism. | ||||
| TR-Bench (In-Domain) | Recall@5 | 0.73 | 0.81 | +0.08 |