| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| KEDiT demonstrates superior performance on the general open-domain benchmark compared to baselines. | ||||
| Wizard of Wikipedia (Test Seen) | ROUGE-L | 35.8 | 37.5 | +1.7 |
| Wizard of Wikipedia (Test Seen) | F1 | 37.1 | 39.6 | +2.5 |
| Performance on the specialized medical domain shows larger gains, validating the method for domain-specific knowledge. | ||||
| PubMed-Dialog | ROUGE-L | 32.4 | 35.1 | +2.7 |
| PubMed-Dialog | BLEU-2 | 11.2 | 13.8 | +2.6 |
| Ablation studies confirm the necessity of both the compression (Info Bottleneck) and the specific adapter architecture. | ||||
| PubMed-Dialog | ROUGE-L | 33.9 | 35.1 | +1.2 |
| PubMed-Dialog | ROUGE-L | 34.2 | 35.1 | +0.9 |