Proprioception: Sensing the robot's own internal state (joint positions, body orientation) without external sensors like cameras
Exteroception: Sensing the external environment (e.g., via LiDAR or cameras)
CENet: Context-Aided Estimator Network—the proposed neural network module that estimates both body velocity and terrain context
AdaBoot: Adaptive Bootstrapping—a training technique that tunes the probability of using estimated vs. ground-truth states based on learning stability
Asymmetric Actor-Critic: RL architecture where the Critic (value function) sees privileged ground-truth info while the Actor (policy) sees only partial observations
RMA: Rapid Motor Adaptation—a baseline method that adapts to terrain by encoding recent history into a latent vector
VAE: Variational Auto-Encoder—a generative model used here to learn a compressed representation of the terrain by reconstructing observations
PPO: Proximal Policy Optimization—the reinforcement learning algorithm used to train the policy
Privileged observations: Information available only in simulation (e.g., ground truth terrain friction, exact body velocity) used to train the Critic
ELU: Exponential Linear Unit—activation function used in the neural networks