| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Preliminary studies (Motivation section) demonstrate the limitations of current approaches. | ||||
| Average across 4 datasets | Answer Recall | 0.45 | 0.58 | +0.13 |
| CAmbigNQ | Precision | 0.22 | 0.38 | +0.16 |
| Not specified (General aggregate) | Response Time | Not reported in the paper | Not reported in the paper | Not reported in the paper |