Flow Matching: A generative modeling technique that learns a velocity field to transform a simple prior distribution (noise) into a complex data distribution.
GRPO: Group Relative Policy Optimization—an RL algorithm that estimates advantages by normalizing rewards within a group of outputs generated from the same prompt, removing the need for a value function critic.
Trajectory Branching: A sampling method where a generation path splits at a specific timestep; one branch continues deterministically, while others inject noise, isolating the impact of that specific step.
ODE: Ordinary Differential Equation—a deterministic process used for sampling in flow models.
SDE: Stochastic Differential Equation—a probabilistic process involving noise injection, used here for exploration during training.
Geneval: A benchmark for evaluating compositional capabilities of text-to-image models (e.g., object counting, spatial relationships).
PickScore: A metric and reward model trained to predict human preferences for generated images.