Flow Matching: A generative modeling technique that learns a vector field to transform a simple noise distribution into a complex data distribution (like robot actions) over a continuous time path
VLA: Vision-Language-Action models—large foundation models for robotics that take images and text as input and output robot control actions
GRPO: Group Relative Policy Optimization—an RL algorithm that estimates advantages by comparing a group of outputs from the same input, eliminating the need for a separate value network
RWFM: Reward-Weighted Flow Matching—a method that weights the flow-matching loss by the exponentiated reward of the demonstration, prioritizing high-quality data
Action Chunking: Predicting a sequence of future actions (a trajectory) at once, rather than just a single next-step action
ELBO: Evidence Lower Bound—a proxy objective used to approximate the likelihood of data in variational inference, often computationally expensive for flow models
U-Net: A neural network architecture with skip connections, commonly used to predict velocity fields in diffusion and flow-matching models
Unicycle Dynamics: A simplified vehicle model often used in robotics simulation where motion is constrained by heading and velocity