| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Evaluation of parameter-modifying editing techniques on Qwen2.5-7B-Instruct using the KnowGIC benchmark. High IFR (lower is better) indicates failure to suppress indirect leakage. High Preservation (higher is better) indicates safety of unrelated knowledge. | ||||
| KnowGIC | IFR (Indirect Fact Recovery) | 0.48 | 0.33 | -0.15 |
| KnowGIC | IFR (Indirect Fact Recovery) | 0.48 | 0.19 | -0.29 |
| KnowGIC | Preservation | 1.00 | 0.98 | -0.02 |
| KnowGIC | Preservation | 1.00 | 0.58 | -0.42 |
| Evaluation on Llama-3-8B-Instruct showing similar trends. | ||||
| KnowGIC | IFR (Indirect Fact Recovery) | 0.54 | 0.45 | -0.09 |