I2I recommendation: Item-to-Item recommendation—recommending items similar to a specific target item the user is currently viewing
GCL: Generative-Contrastive Learning—a training approach combining contrastive loss (distinguishing related items) and generative loss (producing text/hashtags)
CSFT: Collaborative Supervised Fine-tuning—fine-tuning the model to generate hashtags/categories, termed 'Collaborative' because it enhances the embedding used for recommendation
Note Compression Prompt: A specific prompt template designed to instruct the LLM to compress input text into a single special token ([EMB]) before generating output
co-occurrence mechanism: A method to define 'related' notes based on user behavior: if users frequently view Note A then click Note B, they are considered related
virtual token: A special token (e.g., [EMB]) inserted into the sequence whose hidden state is treated as the dense vector representation of the entire input
AUC: Area Under the ROC Curve—a metric measuring the ability of a classifier to distinguish between positive and negative samples
Recall@k: The proportion of relevant items found in the top-k recommendations