Shilling Attack: An attack where adversaries inject fake user profiles with synthetic interactions into a recommender system to manipulate item rankings
GRPO: Group Relative Policy Optimization—a reinforcement learning algorithm that optimizes a policy by normalizing rewards within a sampled group of outputs, used here to fine-tune the LLM
PCA: Principal Component Analysis—a dimensionality reduction technique used here to compute user similarity in a lower-dimensional space
Unpopular-Item Ratio: The proportion of items in a user's history that belong to the lowest popularity percentile; high values are a heuristic for detecting attackers who select obscure filler items
RFT: Reinforcement Fine-Tuning—fine-tuning a model using reinforcement learning signals (rewards) rather than just supervised labels
NDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that accounts for the position of relevant items in a recommendation list
Hit Ratio (HR): The percentage of users for whom the target item appears in the top-N recommendations
CoT: Chain-of-Thought—a prompting technique where the model generates intermediate reasoning steps before the final answer