Post-hoc explanation: Generating explanations after a recommendation has already been made by a separate model, rather than generating the recommendation and explanation simultaneously
Instruction Tuning: Fine-tuning a pre-trained language model on a dataset of (instruction, input, output) triples to improve its ability to follow specific tasks
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that injects trainable low-rank matrices into frozen model layers to reduce memory usage
Chain of Thought (CoT): A prompting technique where the model is encouraged to produce intermediate reasoning steps before the final answer
Discriminator: In this context, an LLM fine-tuned to judge and rank the quality of two different explanations for the same item
Embedded methods: Recommendation approaches where the explanation generation is tightly integrated into the model architecture (e.g., using attention weights)
PEPLER: A baseline explainable recommendation model that generates text explanations based on user and item IDs