Vector Quantization (VQ): A compression technique where vectors of weights are mapped to the nearest entry in a fixed-size codebook, replacing the vector with an index.
Codebook: A table of representative vectors (codewords) used to reconstruct the original weights.
Assignment: The index pointing to a specific codeword in the codebook for a given weight sub-vector.
DiT: Diffusion Transformer—a class of diffusion models that replaces the U-Net backbone with a Transformer architecture.
Zero-data Calibration: A calibration method that uses synthetic data (e.g., Gaussian noise) instead of real images to tune quantization parameters.
FID: Fréchet Inception Distance—a metric for evaluating the quality of generated images by comparing the distribution of generated images to real images; lower is better.
sFID: Spatial Fréchet Inception Distance—a variant of FID that captures spatial relationships better.
IS: Inception Score—a metric measuring the diversity and clarity of generated images; higher is better.