← Back to Paper List

Personalization of industrial human–robot communication through domain adaptation based on user feedback

Debasmita Mukherjee, Jayden Hong, Haripriya Vats, Sooyeon Bae, H. Najjaran
University of British Columbia, University of Victoria, Indira Gandhi Delhi Technical University for Women, University of Toronto
User modeling and user-adapted interaction (2024)
P13N MM Speech

📝 Paper Summary

Human-Robot Collaboration (HRC) Robot Perception Personalized Machine Learning
PF-HRCom personalizes generic facial expression recognition models for specific industrial users by leveraging voice command feedback to auto-label small batches of user-specific image data.
Core Problem
Generic perception models trained on standard public datasets fail to generalize to specific industrial users and dynamic environments due to domain shifts (lighting, background, idiosyncratic expressions).
Why it matters:
  • Standard datasets (like KDEF) contain posed, exaggerated expressions that differ significantly from natural, subtle human behaviors in real-world industrial settings
  • Manually collecting and labeling large, personalized datasets for every new worker or environment change is labor-intensive and impractical
  • Inaccurate emotion recognition in safety-critical tasks can lead to dangerous failures if a robot misses cues that a human is distracted or confused
Concrete Example: A generic model trained on clean lab data might misclassify a worker's 'focused' expression as 'angry' due to harsh factory lighting or a cluttered background. The proposed system asks the user 'Are you engaged?' via voice, uses the 'Yes/No' answer to auto-label their current face image, and retrains itself.
Key Novelty
Personalization through Feedback-enabled Human-Robot Communication (PF-HRCom)
  • Uses a robust, high-accuracy modality (voice commands) to provide ground truth labels for a noisier, harder-to-label modality (facial expressions) in real-time
  • Employs iterative transfer learning on very small batches of user data mixed with the original generic dataset to adapt to the specific user without catastrophic forgetting
Evaluation Highlights
  • +19.6% accuracy improvement on cluttered user-specific data (DS2) after adapting a generic KDEF-trained model using the PF-HRCom framework
  • Achieves 0.76 F1-score on cluttered user data significantly faster (fewer training iterations) by mixing small batches of user data with the original dataset
  • Eliminates the need for manual annotation by successfully using voice feedback to auto-label user images during the collaboration task
Breakthrough Assessment
5/10
A practical, application-specific framework for industrial safety. While the core ML technique (transfer learning) is standard, the cross-modal feedback loop for auto-labeling in an industrial context is a useful system-level contribution.
×