HLLM: Hierarchical Large Language Model—the proposed two-tier architecture separating item feature extraction from user interest modeling.
Item LLM: The first tier of HLLM that converts raw item text descriptions into dense vector embeddings using a special [ITEM] token.
User LLM: The second tier of HLLM that takes a sequence of item embeddings (from Item LLM) and predicts the embedding of the next item of interest.
InfoNCE loss: A contrastive loss function used to maximize the similarity between the predicted embedding and the ground-truth item embedding while minimizing similarity with negative samples.
Cold-start: A scenario where the system must recommend items or serve users with little to no prior interaction history.
Scaling law: The observation that model performance improves primarily as a power-law function of model size, dataset size, and compute.
SASRec: Self-Attentive Sequential Recommendation—a baseline model that uses a Transformer encoder to model user sequential behavior based on Item IDs.
ID-based models: Traditional recommendation models that represent users and items as unique numerical IDs mapped to learned embeddings.
Generative recommendation: A paradigm where the model generates the representation (or ID/text) of the next item directly.
Discriminative recommendation: A paradigm where the model scores a specific user-item pair to predict the likelihood of interaction (e.g., click/no-click).