| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Logic Probes (LLaMA-2-7B) | Pearson Correlation (r) | 0 | -0.91 | -0.91 |
| Mistral-7B Training | Roughness Drop | 1.00 | 0.63 | -0.37 |
| Format Probes (Mistral-7B) | Probe Margin | 0.0 | 4.44 | +4.44 |
| Format Probes (LLaMA-2-7B) | Probe Margin | 0 | -2.0 | -2.0 |