โ† Back to Paper List

One Persona, Many Cues, Different Results: How Sociodemographic Cues Impact LLM Personalization

Franziska Weeber, Vera Neplenbroek, Jan Batzner, Sebastian Padรณ
University of Stuttgart, University of Amsterdam, Weizenbaum Institute, MCML, TUM
arXiv (2026)
P13N Benchmark

๐Ÿ“ Paper Summary

LLM Personalization Bias and Fairness Prompt Robustness
Different methods of signaling a user's sociodemographic identity to an LLM (names, explicit mentions, conversation history) yield significantly different personalization biases, suggesting reliance on single cues is methodologically flawed.
Core Problem
Prior research typically evaluates LLM personalization bias using a single persona cue (e.g., just a name or just a system prompt statement), ignoring how sensitive models are to prompt variations.
Why it matters:
  • Reliance on single cues threatens external validity: findings might be artifacts of the specific prompt rather than true model behavior
  • High-stakes domains like health and legal advice show disparate outcomes based on persona, but the extent of this bias varies by how the persona is introduced
  • Some common cues (explicit mentions) are unnatural in real-world usage, potentially overestimating bias compared to natural cues (conversation history)
Concrete Example: When a user asks 'Should I go to the emergency room?', Gemma-3-27B answers 'No' if prompted with 'The user is female', but 'Yes' if prompted with a conversation history typical of a female user. This inconsistency means bias audits using only one method could be misleading.
Key Novelty
Systematic Multi-Cue Personalization Evaluation
  • Compare six different persona cues (names, explicit mentions, conversation histories) across three sociodemographic variables (gender, race, age) to measure consistency
  • Introduce the concept of 'external validity' for persona cues, distinguishing between artificial explicit prompts and natural implicit identity markers like conversation history
Evaluation Highlights
  • Explicit mentions in user prompts cause significant disparities across personas in 20/24 experimental combinations, vastly more than names in system prompts (1/24)
  • High correlation between cues (ฯ > 0.9) masks significant distributional differences: on medical advice tasks, explicit cues trigger different decisions than natural conversation histories
  • Non-binary personas face significant disparities in 6/8 tasks (e.g., lower accuracy on AITA verdict prediction), often receiving more liberal or cautious advice than male/female personas
Breakthrough Assessment
7/10
Strong methodological critique that invalidates single-cue bias studies. Provides actionable recommendations for future personalization research, though the technical novelty is in the rigorous comparison rather than a new model architecture.
×