Abstract

Since 2020, the COVID-19 epidemic has changed our lives in healthcare behaviors. Forced to wear masks influenced doctor-patient interaction perceptions truly, thus, to build a satisfying relationship is not just empathize with facial expressions. The voice becomes more important for the sake of conquering the burden of masks. Hence, verbal and non-verbal communication will be crucial criteria for doctor-patient interaction during medical consultations and other conversations. In these years, speech emotion recognition has been a popular research domain. In spite of abundant work conducted, nonverbal emotion recognition in medical scenarios is still required to reveal. In this study, we investigate YAMNet transfer learning on Chinese Mandarin speech corpus NTHU-NTUA Chinese Interactive Emotion Corpus (NNIME) and use real-world dermatology clinic recording to test the generalization capability. The results showed that the accuracy validated on NNIME data was 0.59 for activation prediction and 0.57 for valence. Furthermore, the validation accuracy on the doctor-patient dataset was 0.24 for activation and 0.58 for valence, respectively.

Original languageEnglish
Title of host publicationMEDINFO 2023 - The Future is Accessible
Subtitle of host publicationProceedings of the 19th World Congress on Medical and Health Informatics
EditorsJen Bichel-Findlay, Paula Otero, Philip Scott, Elaine Huesing
PublisherIOS Press BV
Pages1121-1125
Number of pages5
ISBN (Electronic)9781643684567
DOIs
Publication statusPublished - Jan 25 2024
Event19th World Congress on Medical and Health Informatics, MedInfo 2023 - Sydney, Australia
Duration: Jul 8 2023Jul 12 2023

Publication series

NameStudies in Health Technology and Informatics
Volume310
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Conference

Conference19th World Congress on Medical and Health Informatics, MedInfo 2023
Country/TerritoryAustralia
CitySydney
Period7/8/237/12/23

Keywords

  • bidirectional long short-term memory networks
  • doctor-patient communication
  • medical education
  • Speech emotion recognition
  • YAMNet transfer learning

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Fingerprint

Dive into the research topics of 'Speech Emotion Recognition Applied to Real-World Medical Consultation'. Together they form a unique fingerprint.

Cite this