Semantic IDs (S-IDs): Discrete tokens representing an item (ad) derived from its semantic content (text) rather than an arbitrary integer ID
RQ-VAE: Residual-Quantized Variational AutoEncoder—a model that compresses high-dimensional embeddings into a sequence of discrete codes (tokens) used as Semantic IDs
DPO: Direct Preference Optimization—a method to align language models with human/business preferences without a separate reward model
Trie-Tree: A prefix tree data structure used here to constrain the LLM's generation, ensuring it only produces valid sequences of Semantic IDs corresponding to real ads
GMV: Gross Merchandise Value—a total value of merchandise sold over a given period of time through a customer-to-customer (C2C) exchange site
HitRatio: A metric measuring the proportion of times the relevant item is present in the top-K recommended items
NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that accounts for the position of relevant items
TensorRT: A high-performance deep learning inference optimizer and runtime library developed by NVIDIA
information cocoon: A situation where users are exposed only to information that reinforces their existing views or interests, limiting diversity
constrained decoding: Forcing an LLM to generate tokens only from a valid set of options (e.g., valid ad IDs) rather than open-ended text