DIRE: Discriminative Recommendation Framework—using open-source LLMs as trainable content encoders to extract embeddings
GENRE: Generative Recommendation Framework—using closed-source LLMs to generate synthetic data (summaries, profiles) to augment training
PLM-NR: Pretrained Language Model for News Recommendation—a baseline method using BERT-sized models as encoders
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that freezes main weights and trains small decomposition matrices
Chain-based Generation: Iterative prompting where outputs from one step (e.g., user profile generation) are used as inputs for the next (e.g., synthetic content generation)
Natural Concator: A strategy of concatenating multi-field text data using natural language templates (e.g., 'news article: <title>...') rather than special separation tokens
MIND: Microsoft News Recommendation Dataset—a large-scale benchmark for news recommendation
Goodreads: A book recommendation dataset used for evaluating content-based filtering performance
warm user: A user with more than 5 interactions in their history
cold user: A user with 5 or fewer interactions in their history
nDCG: Normalized Discounted Cumulative Gain—a ranking metric that values correct items appearing earlier in the list