LLM-ARS: LLM-based Agentic Recommender Systems—systems where LLMs act as autonomous agents to plan and execute recommendations.
ID-based features: Traditional recommendation inputs representing users and items as unique numerical identifiers (embeddings), which lack semantic richness.
MDP: Markov Decision Process—a mathematical framework for modeling decision-making where outcomes are partly random and partly under the control of a decision maker.
Implicit feedback: User signals inferred from behavior (e.g., clicks, watch time) rather than explicit ratings, often noisy and hard to interpret.
ReAct: Reasoning + Acting—a paradigm where LLMs generate reasoning traces before executing actions, allowing for dynamic adjustment.
MLLM: Multimodal Large Language Model—an LLM capable of processing and generating multiple data types (text, images, audio).
SFT: Supervised Fine-Tuning—training a model on labeled examples to adapt it to a specific task.