Memorization Coverage: The percentage of items, users, or interactions in a dataset that an LLM can reproduce exactly when prompted with their identifiers
MovieLens-1M: A classic recommender systems dataset containing 1 million ratings from 6,000 users on 4,000 movies
Popularity Bias: The tendency of a model to recommend or memorize items that appear frequently in the data, often ignoring niche or 'long-tail' items
HR@1: Hit Rate at 1βa metric measuring whether the single top-recommended item is the correct ground-truth item
nDCG: Normalized Discounted Cumulative Gainβa ranking metric that accounts for the position of relevant items in the recommendation list
Data Leakage: When test data is inadvertently included in the training set, allowing the model to 'cheat' by memorizing answers rather than generalizing
Zero-shot prompting: Asking the model to perform a task without providing any examples in the prompt
Few-shot prompting: Providing a few examples of the task (e.g., input-output pairs) in the prompt to guide the model