| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Main comparison on ReDial dataset showing USB-Rec dominance in iEval and competitive Recall@1. | ||||
| ReDial | iEval | 1.14 | 1.29 | +0.15 |
| ReDial | Recall@1 | 0.045 | 0.050 | +0.005 |
| Main comparison on OpenDialKG dataset. | ||||
| OpenDialKG | iEval | 1.06 | 1.40 | +0.34 |
| OpenDialKG | Recall@1 | 0.246 | 0.300 | +0.054 |
| Ablation study demonstrating the impact of adding SES to non-finetuned models. | ||||
| ReDial | iEval | 0.85 | 0.99 | +0.14 |