_comment: REQUIRED: Define ALL technical terms, acronyms, and method names used ANYWHERE in the entire summary. After drafting the summary, perform a MANDATORY POST-DRAFT SCAN: check every section individually (Core.one_sentence_thesis, evaluation_highlights, core_problem, Technical_details, Experiments.key_results notes, Figures descriptions and key_insights). HIGH-VISIBILITY RULE: Terms appearing in one_sentence_thesis, evaluation_highlights, or figure key_insights MUST be defined—these are the first things readers see. COMMONLY MISSED: PPO, DPO, MARL, dense retrieval, silver labels, cosine schedule, clipped surrogate objective, Top-k, greedy decoding, beam search, logit, ViT, CLIP, Pareto improvement, BLEU, ROUGE, perplexity, attention heads, parameter sharing, warm start, convex combination, sawtooth profile, length-normalized attention ratio, NTP. If in doubt, define it.
LLM: Large Language Model
Surrogate Model: An interpretable model trained to approximate the predictions of a complex, black-box model to provide explanations
Behavior Alignment: Training the LLM to mimic the input-output behavior of the target model using text
Intention Alignment: Training the LLM to understand the target model's internal state by projecting recommender embeddings into the LLM's token space
Hybrid Alignment: Combining text-based and embedding-based inputs to train the LLM
Two-Tower Model: A recommender architecture where users and items are encoded independently into embeddings, and their dot product determines the score
ShareGPT: A dataset of user-ChatGPT conversations used for instruction tuning to maintain general LLM capabilities
Unigram: A simple baseline that recommends items based on their global popularity frequency
SASRec: Self-Attentive Sequential Recommendation—a sequence-based recommender model used as a target black-box model in this paper
MF: Matrix Factorization—a classic collaborative filtering method used as a target black-box model
BLEU: Bilingual Evaluation Understudy—a metric for evaluating the quality of text which computes the overlap of n-grams between candidate and reference texts
ROUGE: Recall-Oriented Understudy for Gisting Evaluation—a set of metrics used to evaluate automatic summarization and machine translation
NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that takes into account the position of relevant items
HR: Hit Ratio—the fraction of users for whom the correct item is included in the recommendation list