SFT: Supervised Fine-Tuning—retraining a pre-trained model on a smaller, task-specific dataset to adapt its behavior
HR@k: Hit Ratio at k—the proportion of test cases where the target item is present in the top-k recommendations
NDCG@k: Normalized Discounted Cumulative Gain at k—a ranking metric that accounts for the position of relevant items in the list
RMSE: Root Mean Square Error—a standard metric for rating prediction measuring the average magnitude of error
ROUGE: Recall-Oriented Understudy for Gisting Evaluation—a set of metrics used to evaluate automatic summarization and machine translation
P-tuning: A parameter-efficient fine-tuning method that optimizes continuous prompt embeddings rather than all model parameters
LoRA: Low-Rank Adaptation—a fine-tuning technique that injects trainable low-rank decomposition matrices into pre-trained weights
Matrix Factorization (MF): A traditional recommendation technique that decomposes the user-item interaction matrix into lower-dimensional user and item latent factors
Chain-of-Thought (CoT): A prompting technique that encourages the model to generate intermediate reasoning steps