POMDP: Partially Observable Markov Decision Process—a mathematical framework for modeling decision-making where the agent does not fully observe the state
FuncEnv: Functional Environment—a stateless API design in Gymnasium where transitions are pure functions, enabling easier parallelization and hardware acceleration (e.g., JAX)
VectorEnv: Vectorized Environment—an abstraction running multiple environment instances in parallel to batch observations and actions, increasing training throughput
MuJoCo: Multi-Joint dynamics with Contact—a physics engine used for simulating complex robotics environments
Box2D: A 2D physics engine used for simpler continuous control tasks like Lunar Lander
Truncation vs. Termination: Termination means the episode ended naturally (e.g., game over); Truncation means it was stopped artificially (e.g., time limit reached), requiring different handling in bootstrapping value estimates