RecPIE: Recommendation with Prediction-Informed Explanations—the proposed framework joint optimizing explanations and predictions.
PPO: Proximal Policy Optimization—an RL algorithm used here to fine-tune the LLM based on recommendation accuracy rewards.
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning method for LLMs.
POI: Point of Interest—a specific location (e.g., restaurant, park) in geospatial recommendation tasks.
discriminative model: A model that classifies or ranks inputs (e.g., predicting which item is 'correct'), as opposed to generating new data.
generative model: A model that creates new data instances (e.g., text explanations), like an LLM.
hallucination: When an LLM generates plausible but factually incorrect or ungrounded information.
Recall@k: A metric measuring the proportion of relevant items found in the top-k recommendations.