← Back to Paper List

Verifiable Reasoning for LLM-based Generative Recommendation

Xinyu Lin, Hanqing Zeng, Hanchao Yu, Yinglong Xia, Jiang Zhang, Aashu Singh, Fei Liu, Wenjie Wang, Fuli Feng, Tat-Seng Chua, Qifan Wang
Meta Modern Recommendation System, National University of Singapore
arXiv (2026)
Recommendation Reasoning P13N

📝 Paper Summary

LLM-based Generative Recommendation Chain-of-Thought Reasoning
VRec introduces a reason-verify-recommend paradigm that interleaves reasoning steps with a mixture of verifiers to correct errors and prevent homogeneous reasoning in LLM-based recommendation.
Core Problem
Standard 'reason-then-recommend' approaches suffer from reasoning degradation, where LLMs fall into homogeneous loops (repeating spurious correlations) or accumulate errors across autoregressive steps due to a lack of intermediate supervision.
Why it matters:
  • Without verification, LLMs may shortcut reasoning to rely on surface-level correlations rather than deep user preference understanding.
  • Early missteps in the reasoning chain propagate, leading to hallucinations or irrelevant recommendations.
  • Existing methods optimize only the final recommendation token, leaving the latent reasoning process unguided and prone to degeneration.
Concrete Example: In a movie recommendation scenario, an unverified model might see a user watched 'Titanic' and simply reason 'User likes romance' repeatedly (homogeneous), or incorrectly infer 'User likes sinking ships' and recommend a documentary on shipwrecks, with this error compounding in subsequent steps.
Key Novelty
Reason-Verify-Recommend Paradigm (VRec)
  • Interleaves reasoning generation with a 'verification' step, where a Mixture of Verifiers (MoV) evaluates the reasoning embedding against specific user preference aspects (e.g., category, style).
  • Uses the verifier's prediction entropy as a 'reliability' signal; if reasoning is vague (high entropy), the system intervenes.
  • Rectifies reasoning embeddings using the verifier's internal weights (acting as preference prototypes) to guide the LLM back to a valid reasoning path.
Architecture
Architecture Figure Figure 4
The architecture of VRec (Verifiable Reasoning) and its training strategy.
Breakthrough Assessment
8/10
Proposes a structurally novel 'verify-as-you-go' mechanism for latent reasoning in recommendations, directly addressing the critical issue of reasoning hallucination/degradation.
×