Re-examining the robustness of voice features in predicting depression: Compared with baseline of confounders

Wei Pan; Jonathan Flint; Liat Shenhav; Tianli Liu; Mingming Liu; Bin Hu; Tingshao Zhu

doi:10.1371/journal.pone.0218172

Re-examining the robustness of voice features in predicting depression: Compared with baseline of confounders

PLoS One. 2019 Jun 20;14(6):e0218172. doi: 10.1371/journal.pone.0218172. eCollection 2019.

Authors

Wei Pan^{1

2}, Jonathan Flint³, Liat Shenhav⁴, Tianli Liu⁵, Mingming Liu^{1

2}, Bin Hu⁶, Tingshao Zhu¹

Affiliations

¹ Institute of Psychology, Chinese Academy of Sciences, Beijing, China.
² University of Chinese Academy of Sciences, Beijing, China.
³ Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, United States of America.
⁴ Department of Computer Science, University of California Los Angeles, Los Angeles, United States of America.
⁵ Institute of Population Research, Peking University, Beijing, China.
⁶ School of Information Science & Engineering, Lanzhou University, Lanzhou, China.

Abstract

A large proportion of Depression Disorder patients do not receive an effective diagnosis, which makes it necessary to find a more objective assessment to facilitate a more rapid and accurate diagnosis of depression. Speech data is easy to acquire clinically, its association with depression has been studied, although the actual predictive effect of voice features has not been examined. Thus, we do not have a general understanding of the extent to which voice features contribute to the identification of depression. In this study, we investigated the significance of the association between voice features and depression using binary logistic regression, and the actual classification effect of voice features on depression was re-examined through classification modeling. Nearly 1000 Chinese females participated in this study. Several different datasets was included as test set. We found that 4 voice features (PC1, PC6, PC17, PC24, P<0.05, corrected) made significant contribution to depression, and that the contribution effect of the voice features alone reached 35.65% (Nagelkerke's R2). In classification modeling, voice data based model has consistently higher predicting accuracy(F-measure) than the baseline model of demographic data when tested on different datasets, even across different emotion context. F-measure of voice features alone reached 81%, consistent with existing data. These results demonstrate that voice features are effective in predicting depression and indicate that more sophisticated models based on voice features can be built to help in clinical diagnosis.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
China
Confounding Factors, Epidemiologic
Datasets as Topic
Depression / physiopathology*
Female
Humans
Middle Aged
Models, Theoretical
Voice*

Grants and funding

The authors gratefully acknowledge the generous support from National Basic Research Program of China (2014CB744600). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.