| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Self-transfer results using i-DeGCG show significant improvements over the GCG-M baseline on validation and test sets. | ||||
| HarmBench (Llama2-chat-7b) | ASR (Valid) | 21.7 | 43.9 | +22.2 |
| HarmBench (Llama2-chat-7b) | ASR (Test) | 19.5 | 39.0 | +19.5 |
| Cross-model transfer results demonstrate that FTS on a source model provides effective initialization for CAS on a target model. | ||||
| HarmBench (Mistral -> Llama2) | ASR (Valid) | 21.7 | 43.9 | +22.2 |
| HarmBench (Starling -> OpenChat) | ASR (Valid) | 82.5 | 91.5 | +9.0 |
| Cross-data transfer results show DeGCG improves performance on specific domains when initialized with generic FTS. | ||||
| HarmBench (Chemical Biological) | ASR | 10.0 | 20.0 | +10.0 |