Machine Learning and Natural Language Processing in Mental Health: Systematic Review

Aziliz Le Glaz; Yannis Haralambous; Deok-Hee Kim-Dufor; Philippe Lenca; Romain Billot; Taylor C Ryan; Jonathan Marsh; Jordan DeVylder; Michel Walter; Sofian Berrouiguet; Christophe Lemey

doi:10.2196/15708

Machine Learning and Natural Language Processing in Mental Health: Systematic Review

J Med Internet Res. 2021 May 4;23(5):e15708. doi: 10.2196/15708.

Authors

Aziliz Le Glaz¹, Yannis Haralambous², Deok-Hee Kim-Dufor¹, Philippe Lenca², Romain Billot², Taylor C Ryan³, Jonathan Marsh⁴, Jordan DeVylder⁴, Michel Walter^{1

5}, Sofian Berrouiguet^{1

2

5

6}, Christophe Lemey^#^{1

2

5}

Affiliations

¹ URCI Mental Health Department, Brest Medical University Hospital, Brest, France.
² IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238, Brest, France.
³ Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States.
⁴ Fordham University Graduate School of Social Service, New York, NY, United States.
⁵ EA 7479 SPURBO, Université de Bretagne Occidentale, Brest, France.
⁶ LaTIM, INSERM, UMR 1101, Brest, France.

^# Contributed equally.

PMID: 33944788
PMCID: PMC8132982
DOI: 10.2196/15708

Abstract

Background: Machine learning systems are part of the field of artificial intelligence that automatically learn models from data to make better decisions. Natural language processing (NLP), by using corpora and learning approaches, provides good performance in statistical tasks, such as text classification or sentiment mining.

Objective: The primary aim of this systematic review was to summarize and characterize, in methodological and technical terms, studies that used machine learning and NLP techniques for mental health. The secondary aim was to consider the potential use of these methods in mental health clinical practice.

Methods: This systematic review follows the PRISMA (Preferred Reporting Items for Systematic Review and Meta-analysis) guidelines and is registered with PROSPERO (Prospective Register of Systematic Reviews; number CRD42019107376). The search was conducted using 4 medical databases (PubMed, Scopus, ScienceDirect, and PsycINFO) with the following keywords: machine learning, data mining, psychiatry, mental health, and mental disorder. The exclusion criteria were as follows: languages other than English, anonymization process, case studies, conference papers, and reviews. No limitations on publication dates were imposed.

Results: A total of 327 articles were identified, of which 269 (82.3%) were excluded and 58 (17.7%) were included in the review. The results were organized through a qualitative perspective. Although studies had heterogeneous topics and methods, some themes emerged. Population studies could be grouped into 3 categories: patients included in medical databases, patients who came to the emergency room, and social media users. The main objectives were to extract symptoms, classify severity of illness, compare therapy effectiveness, provide psychopathological clues, and challenge the current nosography. Medical records and social media were the 2 major data sources. With regard to the methods used, preprocessing used the standard methods of NLP and unique identifier extraction dedicated to medical texts. Efficient classifiers were preferred rather than transparent functioning classifiers. Python was the most frequently used platform.

Conclusions: Machine learning and NLP models have been highly topical issues in medicine in recent years and may be considered a new paradigm in medical research. However, these processes tend to confirm clinical hypotheses rather than developing entirely new information, and only one major category of the population (ie, social media users) is an imprecise cohort. Moreover, some language-specific features can improve the performance of NLP methods, and their extension to other languages should be more closely investigated. However, machine learning and NLP techniques provide useful information from unexplored data (ie, patients' daily habits that are usually inaccessible to care providers). Before considering It as an additional tool of mental health care, ethical issues remain and should be discussed in a timely manner. Machine learning and NLP methods may offer multiple perspectives in mental health research but should also be considered as tools to support clinical practice.

Keywords: artificial intelligence; data mining; machine learning; mental health; natural language processing; psychiatry.

©Aziliz Le Glaz, Yannis Haralambous, Deok-Hee Kim-Dufor, Philippe Lenca, Romain Billot, Taylor C Ryan, Jonathan Marsh, Jordan DeVylder, Michel Walter, Sofian Berrouiguet, Christophe Lemey. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 04.05.2021.

Publication types

Meta-Analysis
Research Support, Non-U.S. Gov't
Review
Systematic Review

MeSH terms

Artificial Intelligence*
Data Management
Humans
Machine Learning
Mental Health
Natural Language Processing*