| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Document Parsing benchmarks show GLM-OCR achieving top-tier performance despite its small size, particularly excelling in general document structure recovery. | ||||
| OmniDocBench v1.5 | Overall Score | 94.5 | 94.6 | +0.1 |
| OCRBench (Text) | Score | 75.3 | 94.0 | +18.7 |
| PubTabNet | Score | 88.4 | 85.2 | -3.2 |
| In Key Information Extraction (KIE), GLM-OCR outperforms open-source baselines and even surpasses some proprietary models. | ||||
| Nanonets-KIE | Score | 87.5 | 93.7 | +6.2 |
| Receipt KIE (In-house) | Score | 83.5 | 94.5 | +11.0 |