CRS: Conversational Recommender System—a system that elicits user preferences and recommends items through natural language dialogue
iEvaLM: interactive Evaluation approach based on LLMs—the proposed framework using LLMs as user simulators to evaluate CRSs
Recall@k: A metric measuring the proportion of relevant items found in the top-k recommendations
ReDial: Recommendation Dialogues—a widely used dataset of human-to-human conversations about movie recommendations
OpenDialKG: A multi-domain conversational recommendation dataset (movies, books, sports, music) paired with a Knowledge Graph
User Simulator: An automated agent that mimics human user behavior to interact with and evaluate a dialogue system
Persuasiveness: A subjective metric (0-2) measuring how convincing the system's explanation for a recommendation is
Zero-shot Prompting: Providing a model with a task description and input without any specific examples
LLM: Large Language Model—a deep learning model trained on vast amounts of text data to generate human-like text