| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Comparative analysis of Self-Replicating vs. Non-Replicating attacks across models and messaging architectures. | ||||
| Custom Multi-Agent Dataset (Global Messaging) | Attack Success Rate improvement (GPT-4o) | Not reported as exact aggregate number | Not reported as exact aggregate number | +13.92% |
| Custom Multi-Agent Dataset (Global Messaging) | Attack Success Rate improvement (GPT-3.5) | Not reported as exact aggregate number | Not reported as exact aggregate number | +209% |
| Custom Multi-Agent Dataset | Attack Ignored Rate | 9% | 66% | +57% |
| Social simulation results demonstrating infection spread dynamics. | ||||
| LLM Town (10 agents) | Turns to full infection | 0 | 4.7 | 4.7 |