| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Main results on the TRD dataset showing BRIDGE's superior accuracy across diverse scenarios compared to baselines. | ||||
| TRD (Overall) | Accuracy | 44.75 | 63.22 | +18.47 |
| TRD (Overall) | Accuracy | 51.17 | 63.22 | +12.05 |
| Performance on scenario-specific datasets demonstrates generalization and robustness. | ||||
| RealtimeQA | Accuracy | 66.52 | 68.23 | +1.71 |
| HotpotQA Poisoned | Accuracy | 46.10 | 52.80 | +6.70 |