The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features
- PMID: 29870817
- DOI: 10.1016/j.neuroimage.2018.06.001
The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features
Abstract
Individualized behavioral/cognitive prediction using machine learning (ML) regression approaches is becoming increasingly applied. The specific ML regression algorithm and sample size are two key factors that non-trivially influence prediction accuracies. However, the effects of the ML regression algorithm and sample size on individualized behavioral/cognitive prediction performance have not been comprehensively assessed. To address this issue, the present study included six commonly used ML regression algorithms: ordinary least squares (OLS) regression, least absolute shrinkage and selection operator (LASSO) regression, ridge regression, elastic-net regression, linear support vector regression (LSVR), and relevance vector regression (RVR), to perform specific behavioral/cognitive predictions based on different sample sizes. Specifically, the publicly available resting-state functional MRI (rs-fMRI) dataset from the Human Connectome Project (HCP) was used, and whole-brain resting-state functional connectivity (rsFC) or rsFC strength (rsFCS) were extracted as prediction features. Twenty-five sample sizes (ranged from 20 to 700) were studied by sub-sampling from the entire HCP cohort. The analyses showed that rsFC-based LASSO regression performed remarkably worse than the other algorithms, and rsFCS-based OLS regression performed markedly worse than the other algorithms. Regardless of the algorithm and feature type, both the prediction accuracy and its stability exponentially increased with increasing sample size. The specific patterns of the observed algorithm and sample size effects were well replicated in the prediction using re-testing fMRI data, data processed by different imaging preprocessing schemes, and different behavioral/cognitive scores, thus indicating excellent robustness/generalization of the effects. The current findings provide critical insight into how the selected ML regression algorithm and sample size influence individualized predictions of behavior/cognition and offer important guidance for choosing the ML regression algorithm or sample size in relevant investigations.
Keywords: Functional magnetic resonance imaging (MRI); Individualized prediction; Machine learning; Regression algorithm; Resting-state functional connectivity; Sample size.
Copyright © 2018 Elsevier Inc. All rights reserved.
Similar articles
-
Deep neural networks and kernel regression achieve comparable accuracies for functional connectivity prediction of behavior and demographics.Neuroimage. 2020 Feb 1;206:116276. doi: 10.1016/j.neuroimage.2019.116276. Epub 2019 Oct 11. Neuroimage. 2020. PMID: 31610298 Free PMC article.
-
Connectome-based predictive modeling of attention: Comparing different functional connectivity features and prediction methods across datasets.Neuroimage. 2018 Feb 15;167:11-22. doi: 10.1016/j.neuroimage.2017.11.010. Epub 2017 Nov 6. Neuroimage. 2018. PMID: 29122720 Free PMC article.
-
Bootstrapping promotes the RSFC-behavior associations: An application of individual cognitive traits prediction.Hum Brain Mapp. 2020 Jun 15;41(9):2302-2316. doi: 10.1002/hbm.24947. Epub 2020 Mar 16. Hum Brain Mapp. 2020. PMID: 32173976 Free PMC article.
-
Machine learning in resting-state fMRI analysis.Magn Reson Imaging. 2019 Dec;64:101-121. doi: 10.1016/j.mri.2019.05.031. Epub 2019 Jun 5. Magn Reson Imaging. 2019. PMID: 31173849 Free PMC article. Review.
-
Ten simple rules for predictive modeling of individual differences in neuroimaging.Neuroimage. 2019 Jun;193:35-45. doi: 10.1016/j.neuroimage.2019.02.057. Epub 2019 Mar 1. Neuroimage. 2019. PMID: 30831310 Free PMC article. Review.
Cited by
-
Structural connectome architecture shapes the maturation of cortical morphology from childhood to adolescence.Nat Commun. 2024 Jan 26;15(1):784. doi: 10.1038/s41467-024-44863-6. Nat Commun. 2024. PMID: 38278807 Free PMC article.
-
Development of an interpretable machine learning-based intelligent system of exercise prescription for cardio-oncology preventive care: A study protocol.Front Cardiovasc Med. 2023 Dec 1;9:1091885. doi: 10.3389/fcvm.2022.1091885. eCollection 2022. Front Cardiovasc Med. 2023. PMID: 38106819 Free PMC article.
-
Multilayer meta-matching: translating phenotypic prediction models from multiple datasets to small data.bioRxiv [Preprint]. 2023 Dec 7:2023.12.05.569848. doi: 10.1101/2023.12.05.569848. bioRxiv. 2023. PMID: 38106085 Free PMC article. Preprint.
-
Sensitivity Evaluation of Enveloped and Non-enveloped Viruses to Ethanol Using Machine Learning: A Systematic Review.Food Environ Virol. 2023 Dec 5. doi: 10.1007/s12560-023-09571-2. Online ahead of print. Food Environ Virol. 2023. PMID: 38049702 Review.
-
Using Artificial Intelligence to Identify the Associations of Children's Performance of Coloring, Origami, and Copying Activities With Visual-Motor Integration.Am J Occup Ther. 2023 Sep 1;77(5):7705205080. doi: 10.5014/ajot.2023.050210. Am J Occup Ther. 2023. PMID: 37824724 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
