| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Comparison of MM1.5-30B-Chat against state-of-the-art proprietary and open-source models demonstrates competitive performance. | ||||
| MMBench | Accuracy | 83.4 | 86.6 | +3.2 |
| DocVQA | Accuracy | 92.8 | 91.0 | -1.8 |
| MMMU | Accuracy | 69.1 | 65.2 | -3.9 |
| Evaluation of small-scale models shows MM1.5-1B/3B outperforming similar-sized competitors. | ||||
| MMBench | Accuracy | 70.0 | 73.2 | +3.2 |
| Ablation study on Continual Pre-training resolution proves high resolution is critical. | ||||
| MMBase Score | Score | 58.28 | 60.26 | +1.98 |