FSQ: Finite Scalar Quantization—a method to map continuous embeddings to discrete tokens by projecting them into a hypercube and rounding, avoiding codebook collapse.
MPNet: A pre-trained language model combining Masked Language Modeling and Permuted Language Modeling, used here to encode item text.
STE: Straight-Through Estimator—a technique allowing gradients to bypass non-differentiable rounding functions during backpropagation.
beam search: A search algorithm that explores a graph by expanding the most promising nodes in a limited set, used here to decode tokens into items.
Trie: A tree data structure used to store the item catalog's token sequences, enabling efficient prefix-constrained decoding.
cold-start: A scenario where the system must recommend items or to users with little to no historical interaction data.
zero-shot: Evaluating a model on a dataset or domain it has never seen during training.
NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that accounts for the position of relevant items in the recommendation list.
Hit@K: A metric measuring the proportion of times the ground-truth item appears in the top-K recommendations.