| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| SimVG achieves state-of-the-art accuracy on RefCOCO, RefCOCO+, and RefCOCOg datasets, particularly excelling with the ViT-L backbone. | ||||
| RefCOCOg (val) | Accuracy | 85.87 | 88.03 | +2.16 |
| ReferIt (test) | Accuracy | 76.38 | 80.70 | +4.32 |
| RefCOCOg (val) | Accuracy | 86.53 | 88.03 | +1.50 |
| RefCOCOg (val) | Accuracy | 85.64 | 88.03 | +2.39 |