| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Main comparison on True-False dataset showing cross-domain accuracy improvements. | ||||
| True-False Dataset | Accuracy | 73.1 | 84.5 | +11.4 |
| Main comparison on LogicStruct dataset showing robustness to logical variations. | ||||
| LogicStruct Dataset | Accuracy | 73.7 | 78.9 | +5.2 |
| Analysis of structural consistency improvements. | ||||
| True-False Dataset (cities vs companies) | Cosine Similarity | 0.26 | 0.77 | +0.51 |