← Back to Paper List

Security Considerations for Multi-agent Systems

Tam Nguyen, Moses Ndebugre, Dheeraj Arremsetty
Crew Scaler, North Carolina A&T State University
arXiv (2026)
Agent Memory RAG Benchmark

📝 Paper Summary

Multi-agent AI Safety & Security
This study establishes a taxonomy of 193 distinct security threats for Multi-Agent Systems and empirically demonstrates that current frameworks like NIST RMF fail to cover emerging risks such as inter-agent memory poisoning.
Core Problem
Existing security frameworks (e.g., NIST AI RMF, MITRE ATLAS) assume single-agent properties (statelessness, bounded trust, determinism), failing to address the emergent, behavioral attack surfaces introduced by multi-agent coordination.
Why it matters:
  • Enterprise deployments now delegate authority to agents that schedule cloud operations and manage finances; compromising them allows 'policy-level RCE' without code vulnerabilities
  • Multi-agent systems (MAS) share persistent memory and propagate context, allowing attacks like self-replicating prompt worms to spread across agent boundaries
  • Practitioners lack empirical data on which security frameworks actually cover these new agentic risks, leading to false confidence in traditional governance
Concrete Example: In a 'Policy-level RCE' attack, an adversary injects a prompt that manipulates an agent's reasoning to invoke a valid tool (e.g., 'download code') rather than exploiting a software bug. Traditional guards miss this because the final natural language output looks benign, yet the agent has been commandeered to execute a malicious workflow sequence.
Key Novelty
Comprehensive MAS Threat Taxonomy & Framework Gap Analysis
  • Systematically derives 193 specific multi-agent threats (e.g., 'Tool-mediated compromise', 'Approval Fatigue') distinct from single-agent risks via GenAI-assisted threat modeling
  • Quantitatively scores 16 major security frameworks (including NIST, OWASP, MITRE) against this new taxonomy to expose specific coverage gaps in areas like Non-Determinism and Data Leakage
Evaluation Highlights
  • OWASP Agentic Security Initiative leads all frameworks with 65.3% coverage of the identified MAS threats
  • Non-Determinism is the most under-addressed risk category, with a mean coverage score of only 1.231 out of 3 across all frameworks
  • Data Leakage risks in MAS (e.g., shared session context) are poorly covered, averaging a score of 1.340 out of 3
Breakthrough Assessment
9/10
Establishes the foundational taxonomy for the new field of Multi-Agent Security, exposing critical gaps in established standards like NIST and MITRE with rigorous empirical evidence.
×