← Back to Paper List

Faithful Path Language Modelling for Explainable Recommendation over Knowledge Graph

G Balloccu, L Boratto, C Cancedda, G Fenu, M Marras
University of Cagliari, Polytechnic University of Turin
arXiv, 10/2023 (2023)
Recommendation KG Factuality Reasoning P13N

📝 Paper Summary

Explainable Recommendation Knowledge Graph Reasoning Path Language Modeling
PEARLM eliminates hallucinated explanations in recommender systems by enforcing knowledge graph constraints during decoding and learning token embeddings directly from paths rather than pre-trained embeddings.
Core Problem
Existing path-based language models for recommendation often generate 'hallucinated' paths—sequences of entities and relations that do not actually exist in the Knowledge Graph (KG)—leading to unfaithful explanations.
Why it matters:
  • Hallucinated paths (e.g., inventing a 'starred in' relation between a user and a movie) erode user trust and fail GDPR 'right to explanation' requirements.
  • Current models rely on pre-trained KG embeddings that are optimized for link prediction, not path generation, limiting recommendation accuracy.
  • Users struggle to detect subtle inaccuracies in explanations, meaning systems must guarantee structural faithfulness by design.
Concrete Example: A model might explain a recommendation for 'Interstellar' by claiming the user watched 'Movie A' which starred 'Johnny Depp' who starred in 'Interstellar', even if Johnny Depp is not in Interstellar within the KG. Existing models like PLM generate such non-existent triplets 94% of the time at hop 3.
Key Novelty
Path-based Explainable-Accurate Recommender based on Language Modelling (PEARLM)
  • Treats KG paths as sentences and trains a causal language model (Transformer) to generate them, predicting the next entity/relation token.
  • Introduces Knowledge Graph Constraint Decoding (KGCD) to force the model to only select tokens that are valid neighbors in the KG, guaranteeing zero hallucinations.
  • Learns embeddings directly from path tokens from scratch (Direct Embedding Learning) rather than initializing with pre-trained KG embeddings (like TransE), capturing richer path-centric semantics.
Architecture
Architecture Figure Figure 2
The PEARLM framework pipeline: (a) Path Extraction from KG, (b) Pretraining Causal Language Model, (c) KG-Constrained Decoding.
Evaluation Highlights
  • +42% to +78% improvement in NDCG over best baselines (KGAT, CKE) on MovieLens1M and LastFM datasets.
  • Achieves 100% Path Faithfulness Rate (PFR), completely eliminating corrupted paths, whereas the PLM baseline drops to ~6-10% faithfulness at hop 3.
  • Outperforms state-of-the-art in Coverage by up to 73% on LastFM while maintaining high Serendipity and Novelty.
Breakthrough Assessment
8/10
Significantly advances explainable recommendation by solving the hallucination problem (structural unfaithfulness) while simultaneously delivering massive gains in recommendation utility.
×