← Back to Paper List

Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation

Jian Wang, Yi Cheng, Dongding Lin, Chak Tou Leong, Wenjie Li
The Hong Kong Polytechnic University
Conference on Empirical Methods in Natural Language Processing (2023)
P13N Benchmark Agent Recommendation

📝 Paper Summary

Conversational personalization User-profile based personalization
TOPDIAL is a dataset curation framework that uses role-playing LLM agents (User, System, Moderator) to synthesize large-scale personalized target-oriented dialogues where systems proactively lead conversations while adapting to user personalities.
Core Problem
Existing target-oriented dialogue datasets either lack personalization (ignoring user profiles/personalities) or are not proactive, and manual creation of high-quality personalized datasets is prohibitively expensive.
Why it matters:
  • Without personalization, proactive systems (like recommenders) can seem obtrusive or irrelevant, damaging user experience.
  • Existing datasets are often crowd-sourced without specific target goals or simply re-purposed from non-target data, lacking the specific dynamics of 'leading' a conversation.
  • Training models to be both proactive (reaching a goal) and personalized (respecting user style) requires data that exemplifies both simultaneously.
Concrete Example: In a movie recommendation scenario, a standard system might bluntly recommend 'King of Comedy'. A personalized system, knowing the user is 'shy' and likes 'Stephen Chow', would gently bridge the topic via the actor rather than the genre. Current datasets lack these nuanced, personality-driven transitions.
Key Novelty
LLM-based Role-Playing Data Curation Framework
  • Deploys three interacting LLM agents: a User agent (simulating specific profiles/Big-5 traits), a System agent (optimizing for target achievement), and a Moderator agent (managing termination).
  • Formulates the conversation target as a <dialogue act, topic> pair (e.g., <recommend, 'The Matrix'>) rather than just keywords, ensuring actionable goals.
  • Injects explicit personality traits (Big-5) into the User agent's prompt to generate diverse, human-like resistance or acceptance behaviors.
Architecture
Architecture Figure Figure 1
The role-playing framework for automatic dataset curation involving three agents.
Evaluation Highlights
  • Alpaca-7B trained on TOPDIAL achieves 85.04% target success rate, a +36.26 point improvement over the same model trained on the seed dataset (DuRecDial 2.0).
  • Personalization F1 score improves by +14.94 points (51.99 vs 37.05) for Alpaca-7B when trained on TOPDIAL compared to the seed dataset.
  • Curated ~18K multi-turn dialogues across 4 domains (Movies, Music, Food, POIs) with an average of 12.3 utterances per dialogue.
Breakthrough Assessment
7/10
Significant contribution to data synthesis for a niche but important problem (personalized proactive dialogue). The role-playing framework is well-executed, though the core innovation is the application of LLMs to data curation rather than a new model architecture.
×