NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that gives more credit to correct items ranked higher in the list.
LightGCN: A graph convolutional network designed for recommendation that learns user/item embeddings from the interaction graph.
SASRec: Self-Attentive Sequential Recommendation—a model that uses attention mechanisms to capture sequential patterns in user actions.
Reranking: The process of re-ordering a preliminary list of recommendations to better align with specific user preferences.
Position-Based Feedback: A feedback mechanism that calculates the difference between an item's predicted rank and its target rank to generate corrective instructions.
User Profile Generation: The intermediate step of summarizing a user's raw interaction history into a structured textual profile (e.g., 'likes Sci-Fi').
CoT: Chain-of-Thought—a prompting technique where the model is encouraged to think step-by-step.
Batched Training: Updating the prompt based on feedback aggregated from a group of users rather than a single user, improving generalization.