KDE: Kernel Density Estimation—a non-parametric way to estimate the probability density function of a random variable using a kernel function (like a Gaussian) centered at data points.
Wasserstein Gradient Flow: A continuous evolution of a probability distribution that follows the path of steepest descent with respect to a specific energy functional (like KL divergence) in the Wasserstein space.
Drifting Model: A generative model that evolves the generated distribution during training via a learned vector field (drifting field) to match the data distribution.
MMD: Maximum Mean Discrepancy—a statistical test and divergence measure that compares distributions by computing the distance between their mean embeddings in a kernel Hilbert space.
RKHS: Reproducing Kernel Hilbert Space—a space of functions where evaluation at a point is a continuous linear functional, allowing kernel methods to operate effectively.
vMF kernel: von Mises-Fisher kernel—a kernel function defined on the hypersphere, analogous to a Gaussian kernel in Euclidean space.
f-divergence: A family of divergence measures (including KL, Reverse KL, Chi-squared) measuring the difference between two probability distributions based on a convex function f.
identifiability: The theoretical guarantee that if the model's objective is minimized (loss is zero), the learned distribution is exactly equal to the true data distribution.