| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance gains on the newly constructed SVIP-Test benchmark using the proposed data tuning. | ||||
| SVIP-Test | Accuracy Improvement | 0.0 | 6.3 | +6.3 |
| SVIP-Test | Accuracy Improvement | 0.0 | 2.3 | +2.3 |
| Impact of the specialized TriAtt-CoT reward architecture compared to standard tuning. | ||||
| SVIP-Test | Accuracy Improvement | 0.0 | 5.95 | +5.95 |