Semantic ID: A sequence of discrete tokens (integers) that represents an item, derived from quantizing its content features (e.g., text embeddings).
Product Quantization (PQ): A quantization technique that splits a vector into sub-vectors and quantizes each separately, resulting in a tuple of discrete codes.
Autoregressive Generation: Generating a sequence one token at a time, where each token depends on the previous ones.
Multi-token Prediction (MTP): A training objective where the model predicts multiple future tokens simultaneously and independently, rather than sequentially.
NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that accounts for the position of relevant items in the recommendation list.
Beam Search: A search algorithm that explores a graph by expanding the most promising nodes in a limited set (beam).
Graph-Constrained Decoding: A decoding method proposed in this paper that restricts the search space to neighbors in a pre-computed graph of similar semantic IDs.