Abstract
Dementia can disrupt how people experience and describe events as well as their own role in them. Alzheimer's disease (AD) compromises the processing of entities expressed by nouns, while behavioral variant frontotemporal dementia (bvFTD) entails a depersonalized perspective with increased third-person references. Yet, no study has examined whether these patterns can be captured in connected speech via natural language processing tools. To tackle such gaps, we asked 96 participants (32 AD patients, 32 bvFTD patients, 32 healthy controls) to narrate a typical day of their lives and calculated the proportion of nouns, verbs, and first- or third-person markers (via part-of-speech and morphological tagging). We also extracted objective properties (frequency, phonological neighborhood, length, semantic variability) from each content word. In our main study (with 21 AD patients, 21 bvFTD patients, and 21 healthy controls), we used inferential statistics and machine learning for group-level and subject-level discrimination. The above linguistic features were correlated with patients' scores in tests of general cognitive status and executive functions. We found that, compared with HCs, (i) AD (but not bvFTD) patients produced significantly fewer nouns, (ii) bvFTD (but not AD) patients used significantly more third-person markers, and (iii) both patient groups produced more frequent words. Machine learning analyses showed that these features identified individuals with AD and bvFTD (AUC = 0.71). A generalizability test, with a model trained on the entire main study sample and tested on hold-out samples (11 AD patients, 11 bvFTD patients, 11 healthy controls), showed even better performance, with AUCs of 0.76 and 0.83 for AD and bvFTD, respectively. No linguistic feature was significantly correlated with cognitive test scores in either patient group. These results suggest that specific cognitive traits of each disorder can be captured automatically in connected speech, favoring interpretability for enhanced syndrome characterization, diagnosis, and monitoring.
Copyright: © 2024 Lopes da Cunha et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
MeSH terms
-
Aged
-
Alzheimer Disease* / diagnosis
-
Alzheimer Disease* / psychology
-
Biomarkers
-
Case-Control Studies
-
Executive Function / physiology
-
Female
-
Frontotemporal Dementia* / diagnosis
-
Frontotemporal Dementia* / psychology
-
Humans
-
Machine Learning
-
Male
-
Middle Aged
-
Natural Language Processing
-
Neuropsychological Tests
-
Speech*
Grants and funding
AG is an Atlantic Fellow at the Global Brain Health Institute (GBHI;
https://www.gbhi.org/) and is partially supported with funding from the National Institute On Aging of the National Institutes of Health (NIA-NIH;
https://www.nia.nih.gov/; R01AG075775, 2P01AG019724-21A1); Agencia Nacional de Investigación y Desarrollo (ANID;
https://anid.cl/; FONDECYT Regular 1210176, 1210195); GBHI, Alzheimer’s Association, and Alzheimer’s Society (Alzheimer’s Association GBHI ALZ UK-22-865742); the Latin American Brain Health Institute (BrainLat;
https://brainlat.uai.cl/), Universidad Adolfo Ibáñez, Santiago, Chile (#BL-SRGP2021-01); Programa Interdisciplinario de Investigación Experimental en Comunicación y Cognición (PIIECC;
https://linguisticayliteratura.usach.cl/es/piiecc), Facultad de Humanidades, Universidad de Santiago de Chile. AI is partially supported by grants ANID/FONDECYT Regular (1210195 and 1210176 and 1220995); ANID/FONDAP/15150012; ANID/PIA/ANILLOS ACT210096; ANID/FONDEF ID20I10152 and ID22I10029; Takeda (
https://www.takeda.com/) CW2680521 and the MULTI-PARTNER CONSORTIUM TO EXPAND DEMENTIA RESEARCH IN LATIN AMERICA [ReDLat;
https://red-lat.com/; supported by National Institutes of Health, National Institutes of Aging (R01 AG057234), Alzheimer’s Association (SG-20-725707), Rainwater Charitable foundation – Tau Consortium, and Global Brain Health Institute)]. AS is supported by ANID (FONDAP ID15150012); Fondecyt Regular 1231839 and PIA Anillos ACT210096) & MULTI-PARTNER CONSORTIUM TO EXPAND DEMENTIA RESEARCH IN LATIN AMERICA [ReDLat, supported by National Institutes of Health, National Institutes of Aging (R01 AG057234), Alzheimer’s Association (SG-20-725707), Rainwater Charitable foundation – Tau Consortium, and Global Brain Health Institute)]. The contents of this publication are solely the responsibility of the authors and do not represent the official views of these Institutions. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.