← Back to Paper List

Retrieval-Augmented Conversational Recommendation with Prompt-based Semi-Structured Natural Language State Tracking

Sara Kemper, Justin Cui, Kai Dicarlantonio, Kathy Lin, Danjie Tang, Anton Korikov, Scott Sanner
University of Waterloo, University of Toronto
arXiv (2024)
RAG Recommendation P13N QA

📝 Paper Summary

Conversational Recommendation (ConvRec) Retrieval-Augmented Generation (RAG)
RA-Rec combines LLM-driven semi-structured dialogue state tracking with review-based retrieval to map complex natural language user preferences to items without relying on rigid metadata schemas.
Core Problem
Traditional conversational recommendation relies on mapping user intents to rigid, often incomplete metadata taxonomies, causing systems to fail when users express complex preferences (e.g., 'classy joint') that don't match predefined fields.
Why it matters:
  • User metadata is frequently out-of-date or sparse, leading to poor recommendation quality despite relevant information existing in unstructured reviews
  • Users naturally express preferences in indirect ways ('I'm watching my weight') that standard slot-filling dialogue systems cannot capture or reason about
  • Bridging the gap between expressive user language and item databases requires commonsense reasoning that keyword-based or metadata-based systems lack
Concrete Example: A user states 'I’m watching my weight.' A traditional system fails because there is no 'diet' metadata field. RA-Rec captures this as a natural language value in the state, retrieves reviews mentioning 'low-cal veggie options,' and correctly identifies a relevant restaurant.
Key Novelty
Semi-Structured Natural Language Dialogue State
  • Replaces rigid slot-filling with a JSON state where keys are fixed (for structure) but values are LLM-generated natural language (for nuance), enabling the capture of complex constraints like 'laid-back vibe'
  • Integrates 'Reviewed Item Retrieval' (Late Fusion) into the dialogue loop, scoring items based on how well their individual reviews match the natural language dialogue state rather than matching item metadata
Architecture
Architecture Figure Figure 1
The high-level architecture of the RA-Rec system pipeline, detailing the flow from user utterance to final system response.
Breakthrough Assessment
5/10
A solid system demonstration applying LLMs and RAG to conversational recommendation. While the semi-structured state concept is practical, the paper is a system demo without comparative quantitative benchmarks.
×