Generative Retrieval: A search paradigm where a model generates the identifier of a document/item directly, rather than matching a query vector against a database index
Collaborative Filtering: A recommendation technique that predicts user preference based on patterns of items co-occurring in user history (e.g., playlists)
Semantic IDs: Item identifiers constructed such that similar items have similar IDs (e.g., sharing prefixes), allowing the model to learn relationships between items
Sparse Coding: A representation learning method where data is approximated as a sparse linear combination of a small set of basis vectors (dictionary)
Hits@k: A metric counting the proportion of queries for which at least one relevant item appears in the top-k retrieved results
MRR: Mean Reciprocal Rank—a metric that evaluates the ranking quality by averaging the reciprocal of the rank of the first relevant item
Diversified Beam Search: A decoding algorithm that penalizes generating similar sequences to ensure the set of recommended items covers different aspects of the request