Context Bias: A specific bias where the model over-relies on static prompt text (auxiliary tokens) rather than the dynamic user history input
SFT: Supervised Fine-Tuning—training a pre-trained model on a specific dataset using standard log-likelihood maximization
Group DRO: Group Distributionally Robust Optimization—an optimization technique that minimizes the worst-case loss across predefined groups of data to ensure robust performance
Auxiliary Tokens: Fixed parts of the prompt template (e.g., 'Task: Recommend item', 'Output:') that do not carry user-specific information
Interaction Tokens: Tokens representing the user's actual historical behavior (e.g., titles of previously watched movies)
NDCG@10: Normalized Discounted Cumulative Gain at rank 10—a measure of ranking quality where higher positions are worth more
Short-cut learning: When a model solves a task by relying on spurious correlations (like simple word co-occurrence) rather than the intended reasoning path
FAA: Feature Ablation Attribution—a method to measure how much a model relies on specific input tokens by masking them and observing output changes