SMPL-X: A parametric 3D body model that represents body shape, pose, and facial expressions using a set of low-dimensional parameters
FLAME: A parametric 3D head model specifically designed for facial expressions and head pose
VQ-VAE: Vector Quantized Variational Autoencoder—a generative model that learns a discrete codebook of latent representations to compress high-dimensional data
Masked Modeling: A training technique where parts of the input are hidden (masked), and the model learns to predict them, forcing it to learn robust contextual features
FGD: Fréchet Gesture Distance—a metric measuring the distribution distance between generated and real gestures (lower is better)
LBS: Linear Blend Skinning—a technique to deform a 3D mesh based on skeletal bone transformations
MoSh++: Motion Shaper—a method to estimate SMPL body parameters from sparse motion capture markers