AR: Autoregressive—generating data one step at a time, where each step depends on previous ones.
NAR: Non-Autoregressive—generating multiple data points in parallel to increase speed.
World Model: A learned internal representation of the environment's dynamics, allowing an agent to simulate futures.
Tokenization: Converting continuous data (like vehicle coordinates) into discrete tokens that a language model can process.
GCA: Gated Cross Attention—a mechanism to selectively fuse information from different modalities (e.g., map data and agent tracks).
Rollout: Simulating a sequence of future steps starting from a current state to estimate outcomes.
MDP: Markov Decision Process—a mathematical framework for modeling decision making where outcomes are partly random and partly under the control of a decision maker.
SAC: Soft Actor-Critic—an off-policy reinforcement learning algorithm that optimizes a stochastic policy to maximize expected reward and entropy.