TSC: Traffic Signal Control—optimizing traffic lights to minimize congestion
MARL: Multi-Agent Reinforcement Learning—multiple autonomous agents learning to interact in a shared environment
CTDE: Centralized Training with Decentralized Execution—agents learn using global information but act using only local views
MAPPO: Multi-Agent Proximal Policy Optimization—an RL algorithm adapting PPO for multi-agent settings with a centralized critic
Vissim: A high-fidelity microscopic traffic simulator used for realistic modeling of driver behavior
Green Split: The allocation of green signal duration among different traffic phases within a cycle
Dec-POMDP: Decentralized Partially Observable Markov Decision Process—a mathematical framework for multi-agent decision making under uncertainty and partial visibility
sim-to-real: Transferring policies learned in simulation to the real world
green wave: Coordinating signals so platoons of vehicles hit a sequence of green lights without stopping