| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| DeCapBench (Human Expert Subset) | Pearson Correlation (PCC) | Not reported in the paper | Not reported in the paper | +0.2375 |
| DeCapBench (Human Expert Subset) | Kendall's Tau | Not reported in the paper | Not reported in the paper | +0.1082 |
| VLM Arena (Description Task) | Spearman Correlation | Not reported in the paper | 0.90 | Not reported in the paper |
| mmHal-V | Hallucination Rate (Relative Reduction) | Not reported in the paper | Not reported in the paper | -40.5% |