← Back to Paper List

Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

Chengkai Wang, Baisong Liu
Ningbo University
arXiv (2026)
Recommendation KG Factuality P13N

📝 Paper Summary

Explainable Recommendation LLM-based Recommendation
PURE improves explainable recommendation by selecting reasoning paths that are not only factually correct but also strictly aligned with the user's specific historical preferences, preventing valid but unconvincing justifications.
Core Problem
Existing explainable recommenders assume factual correctness equals trustworthiness, but often generate 'preference-inconsistent' explanations—justifying recommendations with factually true attributes that the user historically dislikes or ignores.
Why it matters:
  • Factually correct but irrelevant explanations (e.g., praising a horror movie's 'jump scares' to a user who hates them) erode user trust.
  • Standard faithfulness metrics only check if attributes exist in the item, failing to detect when the reasoning contradicts user preferences.
  • Current retrieval methods favor high-frequency, generic concepts (hubs) that lack personalization.
Concrete Example: A system recommends a prison drama to a user who likes heartwarming comedies. It explains the recommendation by highlighting 'realistic suffering'—a factually correct attribute of the movie, but exactly what the user avoids. The user feels misunderstood despite the accurate facts.
Key Novelty
Preference-aligned Unhallucinated Reasoning for Explanation (PURE)
  • Intervenes at the retrieval stage (select-then-generate) rather than just generation, filtering evidence to ensure it aligns with user intent before the LLM sees it.
  • Uses a target-aware intent mechanism that dynamically prioritizes user history relevant to the current recommendation, rather than using a static user profile.
  • Introduces a multi-view specificity metric to prune generic 'hub' nodes, prioritizing specific, information-rich paths over popular but vague connections.
Evaluation Highlights
  • Reduces preference inconsistency by significant margins (quantitative metrics imply improvement, though specific percentage deltas are not in snippet) on three real-world datasets compared to baselines.
  • Consistently reduces factual hallucinations while maintaining competitive recommendation accuracy and explanation quality.
  • Introduces new feature-level metrics to quantify preference inconsistency, revealing misalignments that standard factuality metrics fail to detect.
Breakthrough Assessment
7/10
Identifies a subtle but critical failure mode (preference inconsistency) overlooked by standard factuality research. The solution is methodologically sound (graph pruning + prompting), though primarily an integration of existing graph/LLM techniques.
×