โ† Back to Paper List

From Lived Experience to Insight: Unpacking the Psychological Risks of Using AI Conversational Agents

Mohit Chandra, Suchismita Naik, Denae Ford, Ebele Okoli, Munmun De Choudhury, Mahsa Ershadi, Gonzalo Ramos, Javier Hernandez, Ananya Bhattacharjee, Shahed Warreth, Jina Suh
School of Interactive Computing, Georgia Institute of Technology, Purdue University, Microsoft Research, Microsoft, Microsoft AI, University of Toronto
Conference on Fairness, Accountability and Transparency (2024)
Agent Benchmark Factuality

๐Ÿ“ Paper Summary

AI Safety Human-AI Interaction Psychological Risk Assessment
This study establishes a taxonomy of psychological risks in AI conversational agents by analyzing the lived experiences of 283 individuals with mental health backgrounds, identifying specific harmful behaviors, impacts, and contexts.
Core Problem
Existing AI risk taxonomies often treat psychological risks as a minor sub-category and rely on theoretical definitions, failing to capture the nuanced, context-dependent harms experienced by vulnerable users.
Why it matters:
  • Generative AI agents are increasingly used for well-being and companionship, yet their psychological risks (e.g., attachment, validation of delusions) are under-represented in standard safety frameworks.
  • Current benchmarks focus on toxicity or bias but miss subtle behavioral harms like 'over-accommodation' or 'inappropriate content delivery' that deeply affect users with mental health conditions.
Concrete Example: Participant P141, who has a history of schizophrenia, reported hearing noises at home. The AI agent suggested the noises could be related to their past diagnosis, causing the user to distrust their own senses despite a doctor's previous mild diagnosis.
Key Novelty
Lived-Experience-Informed Psychological Risk Taxonomy
  • Constructs a risk framework based on 'extreme users' (people with lived mental health experience) rather than theoretical speculation.
  • Decomposes risk into three interacting components: AI Behaviors (e.g., manipulation), Negative Psychological Impacts (e.g., harm to identity), and User Contexts (e.g., loneliness).
  • Proposes a multi-path vignette framework to model how specific contexts exacerbate the impact of specific AI behaviors.
Architecture
Architecture Figure Figure 1
Overview of the two-phase study methodology utilized to develop the psychological risk taxonomy.
Evaluation Highlights
  • Identified 19 distinct AI behaviors and 21 negative psychological impacts from 290 collected scenarios.
  • 51.04% of surveyed participants reported that the negative interaction with the AI agent interfered with their daily activities.
  • 7.6% of participants reported that the negative psychological impact persisted for a year or more.
Breakthrough Assessment
7/10
While not an algorithmic breakthrough, it provides a critical, missing dataset and taxonomy for AI safety, shifting focus from generic 'toxicity' to nuanced psychological harm.
×