| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Comparative analysis on GenAIBench shows SIDiffAgent outperforming both proprietary and open-source models. | ||||
| GenAIBench | VQA Score | 0.813 | 0.884 | +0.071 |
| GenAIBench | VQA Score | 0.839 | 0.884 | +0.045 |
| GenAIBench | VQA Score | 0.764 | 0.884 | +0.120 |
| GenAIBench | VQA Score | 0.841 | 0.884 | +0.043 |
| GenAIBench | VQA Score | 0.832 | 0.884 | +0.052 |