| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| The recovered latent factors (z1, z2, z3) align strongly with specific benchmark categories. | ||||
| BBH | R^2 | 0 | 0.94 | 0.94 |
| IFEval | R^2 | 0 | 0.90 | 0.90 |
| MATH Lvl 5 | R^2 | 0 | 1.00 | 1.00 |
| Recovered Causal Structure shows specific edge weights between factors for different base models. | ||||
| Latent Graph | Edge Weight (z1 -> z2) | Not reported in the paper | 2.76 | Not reported in the paper |
| Latent Graph | Edge Weight (z2 -> z3) | Not reported in the paper | 0.27 | Not reported in the paper |
| Intervention experiments validate the causal link z2 (Instruction Following) -> z3 (Math). | ||||
| MATH | Accuracy | 0.29 | 0.32 | +0.03 |