| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Circuit Reproduction Scores (CRS) indicate that the discovered circuits (subgraphs) effectively retain the full model's ability to recall knowledge, validating the circuit analysis method. | ||||
| Temporal Knowledge Dataset | CRS | 0 | 50 | +50 |
| Ablation studies show that disabling Temporal Heads harms temporal fact retrieval significantly more than static knowledge. | ||||
| Temporal Knowledge Dataset | Target Probability Drop | 0 | -10 | -10 |
| General QA (TriviaQA) | F1 Score Drop | 0 | 0.6 | 0.6 |