Generative Retrieval: A recommendation paradigm that uses autoregressive models to generate item identifiers directly, rather than ranking existing embeddings
Semantic IDs: Discrete tokens representing items, derived from hierarchical quantization (RQ-VAE) of item embeddings, preserving semantic similarity
RQ-VAE: Residual Quantized Variational AutoEncoder—a method to compress high-dimensional vectors into discrete codebook indices (Semantic IDs)
TIGER: Transformer Index for Generative Recommenders—a state-of-the-art generative retrieval model used as the backbone for Mender
Preference Discerning: The proposed paradigm of explicitly conditioning recommendation models on user preferences expressed in natural language
Steerability: The ability to guide a recommendation model towards or away from specific items using external signals (here, text preferences)
Sentence-T5: A pre-trained encoder model used to generate embeddings for sentences, used here to create semantic IDs for items
Hit Rate (HR@k): A metric measuring the proportion of test cases where the ground truth item is present in the top-k predicted items
NDCG: Normalized Discounted Cumulative Gain—a ranking metric that accounts for the position of relevant items in the recommendation list