MAS: Multi-Agent Systems—systems consisting of multiple independent interactive decision makers (agents) like robots, software units, or LLMs
CTDE: Centralized Training with Decentralized Execution—a paradigm where agents learn using a global view during training but act using only local information during deployment
MAPF: Multi-Agent Path Finding—the problem of finding collision-free paths for multiple agents from start to goal locations
Stigmergy: Indirect coordination where agents communicate by modifying their shared environment (e.g., ants leaving pheromones) rather than direct messaging
MAPPO: Multi-Agent Proximal Policy Optimization—an algorithm applying PPO to multi-agent settings, typically using CTDE
VDN: Value-Decomposition Networks—a method that decomposes a global team reward into individual agent value functions
QMIX: A value-decomposition method that allows for non-linear combinations of agent values, improving upon VDN
LLM-based MAS: Multi-Agent Systems where the agents are Large Language Models capable of reasoning, planning, and natural language communication
Coordination Graph: A graphical representation where nodes are agents and edges represent dependencies or communication channels