SFT: Supervised Fine-Tuning—retraining a pre-trained model on a specific task dataset
Collaborative Filtering: Recommendation approach that predicts user preference based on the patterns of other similar users
NCF: Neural Collaborative Filtering—a deep learning framework that replaces the inner product of matrix factorization with a neural architecture
LightGCN: Light Graph Convolutional Network—a simplified GCN for recommendation that learns embeddings by propagating them on the user-item interaction graph
BDLM: Bridge Domain-specific and LLM models—the authors' proposed framework
Deep Mutual Learning: A training strategy where two models learn collaboratively by teaching each other, often by penalizing differences in their internal representations
Task-specific tokens: Special tokens added to the LLM vocabulary (e.g., <uid1>) to represent specific users or items, distinct from natural language words
HR@K: Hit Rate at K—the proportion of test cases where the target item appears in the top K recommendations
Zero-3 strategy: A memory optimization technique for training large models (from DeepSpeed/ZeRO) that partitions optimizer states across GPUs