Sim-to-Real: Transferring policies learned in a physics simulator to a physical robot
URDF: Unified Robot Description Format—an XML file format used to describe the physical structure (links, joints) of a robot
System Identification: The process of tuning simulation parameters (mass, friction, damping) to match real-world physics
SAM2: Segment Anything Model 2—a computer vision model used here to track and segment objects in video streams
PPO: Proximal Policy Optimization—the reinforcement learning algorithm used to train the robot's control policy
Domain Randomization: Varying simulation parameters (lighting, friction, object mass) during training to make the policy robust to real-world variations
Divide-and-Conquer Distillation: Training specialized policies for specific object subsets first, then using their data to train a single generalist policy