STN: Spatial Transformer Network—a learnable module that actively transforms (e.g., rotates, scales) feature maps within a neural network to correct for spatial variations
InfoNCE: A contrastive loss function used to pull positive pairs (similar data) close and push negative pairs apart in representation space
Curriculum Randomization: A training strategy where the intensity of domain randomization (noise, visual changes) is gradually increased over time to stabilize learning
Sim2Real: Transferring a policy trained in a physics simulation to a physical robot in the real world
Digital Twin: A virtual simulation environment designed to match the real-world setup as closely as possible
SRM: Sample Randomization Method—a data augmentation technique
MV-MWM: Multi-View Masked World Models—a baseline method using masked autoencoders for visual representation
RGB-D: Red, Green, Blue, plus Depth—an image format containing color and distance information