← Back to Paper List

TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems

Shaina Raza, Ranjan Sapkota, Manoj Karkee, Christos Emmanouilidis
Not reported in the paper
arXiv (2025)
Agent Memory Benchmark

📝 Paper Summary

AI Governance Trustworthy AI Agentic Multi-Agent Systems (AMAS)
This review proposes a unified TRiSM (Trust, Risk, and Security Management) framework specifically adapted for Agentic Multi-Agent Systems, introducing new metrics to measure agent synergy and tool utilization.
Core Problem
Current AI governance frameworks focus on general ML or single models, failing to address the unique system-level risks of autonomous, coordinating multi-agent systems (AMAS) such as cascading errors, tool abuse, and emergent misbehavior.
Why it matters:
  • Multi-agent systems exhibit opaque, emergent behaviors that single-model safety checks cannot detect
  • The integration of autonomous planning, memory, and external tool use expands the attack surface significantly beyond traditional ML
  • Existing frameworks (like NIST AI RMF) lack specific controls for inter-agent coordination and dynamic decision provenance
Concrete Example: In a collaborative setting, an agent might experience a 'collusive failure' where one agent's hallucinated output is accepted and amplified by another agent without verification, leading to a compounded error that no single agent would have produced in isolation.
Key Novelty
AMAS-specific TRiSM Framework & Metrics
  • Adapts the AI TRiSM framework (Explainability, ModelOps, Security, Privacy, Governance) specifically for the architectural nuances of multi-agent loops
  • Proposes two novel metrics: Component Synergy Score (CSS) to quantify how well agents enable each other, and Tool Utilization Efficacy (TUE) to measure the correctness and efficiency of external tool calls
Architecture
Architecture Figure Figure 9
A comprehensive architecture for TRiSM-aligned Agentic Multi-Agent Systems, highlighting governance components alongside functional agent modules.
Evaluation Highlights
  • The paper is a review and framework proposal; it does not report empirical performance results on benchmarks.
  • Proposes the Component Synergy Score (CSS) metric to measure inter-agent collaboration quality
  • Proposes the Tool Utilization Efficacy (TUE) metric to evaluate the correctness of tool invocations
Breakthrough Assessment
7/10
A comprehensive conceptual framework that fills a critical gap in agentic AI governance. While it lacks empirical validation of the proposed metrics, the taxonomy and adapted TRiSM pillars provide a necessary roadmap for future secure deployments.
×