← Back to Paper List

Policy Compiler for Secure Agentic Systems

Nils Palumbo, Sarthak Choudhary, Jihye Choi, Prasad Chalasani, Somesh Jha
University of Wisconsin–Madison
arXiv (2026)
Agent Benchmark

📝 Paper Summary

Agent Security Policy Enforcement Authorization
PCAS instruments agentic systems to enforce authorization policies deterministically by maintaining a fine-grained dependency graph of all actions and checking declarative rules before execution.
Core Problem
Current agent systems rely on prompt-based instructions for security, which offers no enforcement guarantees, while linear message logs fail to capture the causal dependencies needed for robust authorization.
Why it matters:
  • Prompt-based policies are ambiguous and easily bypassed by prompt injection or model error, leading to data exfiltration or unauthorized actions
  • Real-world authorization often depends on provenance (e.g., 'approve only if derived from X'), which linear logs obscure
  • Compliance violations in sectors like healthcare or customer service can have severe legal and safety consequences
Concrete Example: A policy states 'access medical records only after supervisor approval.' If an agent simply logs messages linearly, it might not track *which* specific approval causally preceded a request, or if the approval was faked via prompt injection. PCAS tracks the causal graph to ensure the approval event explicitly enables the access event.
Key Novelty
Policy Compiler for Agentic Systems (PCAS)
  • Compiles declarative policies (Datalog) and existing agent code into a secure system that intercepts every action via a reference monitor
  • Models the entire agent state (messages, tool calls, results) as a dependency graph rather than a linear log to track information flow and causal provenance
  • Enforces security deterministically outside the LLM, meaning the policy holds even if the LLM is compromised or hallucinates
Evaluation Highlights
  • Improves policy compliance from 48% to 93% on average across frontier models (Claude Opus 4.5, GPT-5.2, Gemini 3 Pro) in customer service tasks
  • Achieves 100% prevention of policy violations (0 violations allowed) in instrumented runs on customer service benchmarks
  • Effective against prompt injection: blocks unauthorized actions even when models are successfully manipulated by adversarial inputs
Breakthrough Assessment
8/10
Strong conceptual advance by moving security from 'prompt engineering' to deterministic runtime enforcement via dependency graphs. High impact for enterprise agent deployment.
×