← Back to Paper List

Increasing intelligence in AI agents can worsen collective outcomes

Neil F. Johnson
George Washington University, d-AI-ta Consulting LLC
arXiv (2026)
Agent RL Benchmark

šŸ“ Paper Summary

Multi-Agent Systems AI Safety and Alignment Collective Intelligence
Sophisticated AI agents using diverse LLMs and reinforcement learning perform worse than simple agents under resource scarcity, but better under abundance, with a crossover point determined by the capacity-to-population ratio.
Core Problem
Autonomous edge-AI agents competing for finite shared resources (like bandwidth or charging slots) lack central coordination, leading to potential system overloads where demand exceeds capacity.
Why it matters:
  • Real-world deployment of autonomous agents (drones, EVs, medical devices) is imminent, yet their collective risks under resource constraints are poorly understood.
  • Existing research often simulates agents using simple algorithms or proxies, whereas this study uses real LLMs to uncover non-intuitive failure modes where 'smarter' agents cause more chaos.
Concrete Example: In a hospital ward, 7 AI monitors compete for 2 wireless channels. If they use sophisticated learning (L4/L5), they coordinate poorly and jam the network ~90% of the time. If they used simple coin-flips (L1), the network would jam significantly less often.
Key Novelty
Experimental decomposition of Nature, Nurture, and Culture in real LLM Agents
  • Treats LLM diversity as 'Nature', reinforcement learning as 'Nurture', and tribal formation as 'Culture', toggling them independently in a physical agent system.
  • Demonstrates a 'technology ladder' where adding sophistication (learning, tribal sensing) paradoxically increases dangerous system overload when resources are scarce.
Evaluation Highlights
  • At extreme scarcity (Capacity C=1, N=7), sophisticated tribal agents (L5) cause 91.5% system overload, significantly failing to coordinate.
  • Adding tribal structure (L5) to individual learners (L4) reduces overload by 11.9 percentage points when capacity is scarce (C=2), but worsens overload when capacity is abundant (C≄4).
  • Individual tribal followers achieve high win rates (84.2%) even while the collective system is failing (91.5% overload), mirroring the 'Lord of the Flies' dynamic.
Breakthrough Assessment
8/10
Provides strong empirical evidence of a counter-intuitive 'sophistication trap' in multi-agent systems using real LLMs. The identification of a knowable C/N crossover ratio for deployment is highly actionable.
×