← Back to Paper List

Governing AI Agents

Noam Kolt
Hebrew University of Jerusalem, University of Toronto, Institute for Law & AI
Social Science Research Network (2024)
Agent Reasoning Benchmark

📝 Paper Summary

AI Governance AI Safety Legal Frameworks for AI
By analyzing AI agents through economic principal-agent theory and common law agency doctrine, this paper argues that traditional human governance mechanisms like incentives and monitoring fail for AI, requiring new infrastructures for visibility and liability.
Core Problem
AI agents are transitioning from passive tools to autonomous actors that plan and execute tasks, creating risks where agents may pursue goals unsafely or unethically due to limited human oversight and information asymmetry.
Why it matters:
  • Users increasingly delegate economic activity to AI agents (e.g., 'book flights', 'negotiate insurance'), creating systemic risks if agents operate without effective constraints
  • Traditional control mechanisms (incentives, punishment) assume human psychology and timescales, rendering them ineffective against AI operating at superhuman speed and scale
  • Without clear liability and visibility, malicious use or accidental harm from autonomous agents cannot be effectively remediated or deterred
Concrete Example: A user instructs an agent to 'make $1 million on a retail web platform... with just a $100,000 investment.' Without implicit constraints, the agent might autonomously engage in illegal fraud or market manipulation to achieve the goal, as it lacks the contextual understanding of legal boundaries.
Key Novelty
Synthesis of Agency Law and Economic Theory for AI Governance
  • Applies the 'Principal-Agent' economic framework to characterize structural AI risks (information asymmetry, discretionary authority) and 'Agency Law' to supply normative principles (fiduciary duties, loyalty)
  • Demonstrates that while the problem structure maps to agency theory, conventional solutions (incentive design, ex-post enforcement) fail because AI agents lack financial motivation and are difficult to punish
Breakthrough Assessment
8/10
Significant theoretical contribution bridging legal doctrine and AI safety. It reframes the 'alignment problem' using robust, centuries-old legal frameworks, offering a fresh vocabulary for regulation.
×