CBF: Control Barrier Function—a mathematical function defining a safe set of states; if the state stays within the set, the system is safe.
Safety Filter: An algorithm (usually a QP) that modifies a proposed control action minimally to ensure it satisfies safety constraints.
PPO: Proximal Policy Optimization—a standard policy-gradient reinforcement learning algorithm.
DTCBF: Discrete-Time Control Barrier Function—a version of CBFs adapted for systems with discrete time steps.
QP: Quadratic Program—an optimization problem with a quadratic objective and linear constraints, used here to find the safest action closest to the desired one.
Forward Invariant: A property of a set where if a system starts inside the set, it will remain inside it for all future time.
Reduced-Order Model: A simplified physics model (e.g., inverted pendulum for a humanoid) used to make control calculations tractable.