| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Zero-shot scene classification demonstrates GeoChat's superior domain adaptation compared to general-purpose VLMs. | ||||
| UCMerced | Accuracy | 68.00 | 84.43 | +16.43 |
| AID | Accuracy | 51.00 | 72.03 | +21.03 |
| VQA results show GeoChat competes with specialist models while remaining a generalist. | ||||
| RSVQA-LRBEN | Avg. Accuracy | 92.29 | 90.70 | -1.59 |
| RSVQA-HRBEN (Test set 2) | Average Accuracy | 68.40 | 72.30 | +3.90 |
| Grounding and region captioning results highlight the model's spatial reasoning capabilities. | ||||
| GeoChat Benchmark | METEOR | 10.0 | 83.9 | +73.9 |
| GeoChat Benchmark | Accuracy@0.5 | 9.1 | 16.0 | +6.9 |