VQ-VAE: Vector Quantized Variational AutoEncoder—a model that learns discrete representations by snapping continuous latent vectors to the nearest entry in a learned codebook
Codebook: A matrix of learnable vectors (centroids) used to discretize continuous data; input vectors are replaced by the index of the nearest codebook vector
STE: Straight-Through Estimator—a trick to allow backpropagation through non-differentiable discrete operations by copying gradients unchanged
RQ: Residual Quantization—a method that quantizes a vector iteratively, where each step quantizes the residual error of the previous step
PQ: Product Quantization—a method that splits a high-dimensional vector into sub-vectors and quantizes each independently to reduce codebook size
LFQ: Lookup-Free Quantization—methods that effectively discard the explicit codebook, mapping latent dimensions directly to binary or scalar values
FSQ: Finite Scalar Quantization—a method projecting latents to dimensions rounded to a small set of integers, forming an implicit codebook
Semantic IDs: Discrete tokens representing user preferences or item attributes in recommender systems, enabling LLMs to process recommendation tasks
Q-Former: A transformer module that uses learnable query vectors to extract fixed-length features from variable-length inputs