MCP: Model Context Protocol—an open standard that enables consistent connection between AI assistants and systems (data, tools, prompts).
Fuzzy Instructions: Task descriptions that state high-level goals without specifying tool names or execution steps, requiring the agent to infer the workflow.
Dependency Chain: A sequence of tool invocations where the output of one tool is required as the input for a subsequent tool.
POMDP: Partially Observable Markov Decision Process—a mathematical framework for modeling decision-making where the agent cannot directly observe the full state of the environment.
LLM-as-a-Judge: Using a strong LLM to evaluate the quality of another model's outputs based on specific rubrics.
Schema Compliance: Adherence to the formal structure (data types, required fields) defined by a tool's API specification.
Distractor Servers: Additional MCP servers provided to the agent that are irrelevant to the current task, testing the agent's ability to filter noise.