extrinsic dexterity: Using environmental features (gravity, walls, other objects) to manipulate an object, rather than relying solely on the robot's gripper
prehensile manipulation: Manipulation involving grasping (holding) the object
non-prehensile manipulation: Manipulation without grasping, such as pushing, poking, or toppling
impedance controller: A control strategy that manages the relationship between force and position, allowing the robot to be compliant (spring-like) during contact
world model: A learned neural network that predicts the future state of the environment given the current state and action
chamfer distance: A metric measuring the similarity between two point clouds by averaging the distances between nearest neighbors
curriculum learning: A training strategy where the difficulty or complexity of data is gradually increased; here, alternating between data collection and model training
ViT: Vision Transformer—a neural network architecture based on self-attention, applied here to patches of 3D point clouds
FPS: Farthest Point Sampling—an algorithm to select a representative subset of points from a point cloud that covers the space well