← Back to Paper List

Toward an Agentic Infused Software Ecosystem

Mark Marron
University of Kentucky
arXiv (2026)
Agent Reasoning Benchmark

📝 Paper Summary

Software Engineering for AI Agents Programming Language Design Agentic Workflows
The paper proposes the Agentic Infused Software Ecosystem (AISE), a holistic redesign of the software stack (language, tools, runtime) to support AI agents through explicit intents, mechanized validation, and safety.
Core Problem
Current software ecosystems are designed for humans, causing AI agents to struggle with implicit context, tool discovery, and safety, leading to errors when agents must infer hidden behaviors or manage large context windows.
Why it matters:
  • Agents face a 'long tail' of errors due to implicit behaviors and special case semantics in traditional languages.
  • Managing context windows for tool discovery is difficult; agents must guess relevant information if it's not explicit in API signatures.
  • Current human-AI cooperation relies on inefficient manual testing and code review rather than mechanized specification.
Concrete Example: An API `wait(duration: Int)` forces an agent to guess the time unit or retrieve documentation (wasting context tokens), whereas a strongly typed `wait(duration: MilliSeconds)` explicitly encodes the intent and requirement in the signature itself.
Key Novelty
Agentic Infused Software Ecosystem (AISE)
  • Co-designing the programming language (Bosque), tooling, and runtime specifically for agentic needs rather than adapting human-centric tools.
  • Introducing `agent` and `api` language keywords to explicitly distinguish between stochastic agent calls and deterministic workflow invocations.
  • Using strong type aliases with invariants (e.g., regex validation on types) to make code intent explicit and reduce the need for agents to generate defensive logic.
Breakthrough Assessment
7/10
Proposes a fundamental shift in software stack design to accommodate agents. While visionary and theoretically grounded in the Bosque language, it lacks empirical evaluation or a deployed implementation in the paper.
×