← Back to Paper List

On the steerability of large language models toward data-driven personas

Junyi Li, Ninareh Mehrabi, Charith Peris, Palash Goyal, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta
University of Maryland, College Park, Amazon
arXiv (2023)
P13N Recommendation Benchmark

📝 Paper Summary

User modeling Steerable generation
The paper steers LLMs toward specific viewpoints by learning data-driven 'personas' via collaborative filtering of opinion data, rather than relying on broad demographic descriptions.
Core Problem
Existing methods steer LLMs using broad demographic traits (e.g., age, party), which fails to capture the nuanced, latent social groups and diverse opinions present within populations.
Why it matters:
  • LLMs naturally under-represent certain groups (e.g., ages 65+, Mormons) due to randomized viewpoints acquired during fine-tuning
  • Demographic labels are insufficient proxies for actual belief systems; individuals with the same demographics often hold different personas
  • Current steering methods lack the expressiveness to align models with the complex, multi-dimensional nature of human opinion
Concrete Example: Members of 'Cluster-0' (mostly Republicans) believe expanding benefits won't reduce inequality and gun access doesn't cause violence, whereas the general population strongly disagrees (75% and 46% respectively). A demographic-only prompt might miss this specific correlation of beliefs.
Key Novelty
Collaborative Filtering for Persona Steering
  • Uses Matrix Factorization to embed survey respondents into a continuous vector space based on their actual answers, creating 'individual personas'
  • Clusters these embeddings to discover latent 'cluster personas' that group individuals by shared beliefs rather than just demographics
  • Employs a Soft-Prompting Model (SPM) to map these persona embeddings into virtual tokens that steer the frozen LLM
Evaluation Highlights
  • Achieves between 57%-77% improvement in prediction accuracy over best-performing baselines (Demographics and Context-based prompting) across selected LLMs
  • Identifies latent social clusters with distinct belief systems that cross-cut traditional demographic lines (e.g., clusters mixing different education levels but sharing immigration views)
  • Demonstrates that 88.05% of the identified 'Cluster-0' persona distrusts the Democratic party, compared to only 18.21% of the general population, showing strong viewpoint capture
Breakthrough Assessment
7/10
A strong methodological shift from explicit demographic prompting to latent embedding-based steering. The use of collaborative filtering for LLM personalization is a novel and effective application.
×