| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance comparisons on the new UIS-QA benchmark show UIS-Digger outperforming strong baselines, despite all models struggling compared to standard benchmarks. | ||||
| UIS-QA | Accuracy | 25.45 | 27.27 | +1.82 |
| GAIA vs UIS-QA | Accuracy | 70.90 | 24.55 | -46.35 |