Serendipity: A recommendation quality measuring items that are both unexpected and relevant (pleasant surprise)
Proxy metrics: Algorithmic formulas (e.g., SOG, SNPR) used to estimate serendipity when human feedback is unavailable
Zero-shot: Prompting the LLM to perform a task without providing any specific examples of that task in the context
Few-shot: Prompting the LLM with a small set of example inputs and outputs to guide its generation
SOG: Serendipity-Oriented Greedy—a baseline proxy metric combining relevance, diversity, and unpopularity
SNPR: Serendipity-oriented Next POI Recommendation—a baseline metric emphasizing relevance and unexpectedness
SerenEva: The proposed meta-evaluation framework for assessing how well evaluators align with human serendipity judgments
Pearson correlation: A statistic measuring linear correlation between two sets of data (here, predicted scores vs. human ratings)
MAE: Mean Absolute Error—average of absolute differences between prediction and ground truth
RMSE: Root Mean Squared Error—measure of differences between values predicted by a model and the values observed