Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 8;25(1):328.
doi: 10.1186/s13054-021-03749-5.

Machine-learning-based COVID-19 mortality prediction model and identification of patients at low and high risk of dying

Affiliations

Machine-learning-based COVID-19 mortality prediction model and identification of patients at low and high risk of dying

Mohammad M Banoei et al. Crit Care. .

Abstract

Background: The coronavirus disease 2019 (COVID-19) pandemic caused by the SARS-Cov2 virus has become the greatest health and controversial issue for worldwide nations. It is associated with different clinical manifestations and a high mortality rate. Predicting mortality and identifying outcome predictors are crucial for COVID patients who are critically ill. Multivariate and machine learning methods may be used for developing prediction models and reduce the complexity of clinical phenotypes.

Methods: Multivariate predictive analysis was applied to 108 out of 250 clinical features, comorbidities, and blood markers captured at the admission time from a hospitalized cohort of patients (N = 250) with COVID-19. Inspired modification of partial least square (SIMPLS)-based model was developed to predict hospital mortality. Prediction accuracy was randomly assigned to training and validation sets. Predictive partition analysis was performed to obtain cutting value for either continuous or categorical variables. Latent class analysis (LCA) was carried to cluster the patients with COVID-19 to identify low- and high-risk patients. Principal component analysis and LCA were used to find a subgroup of survivors that tends to die.

Results: SIMPLS-based model was able to predict hospital mortality in patients with COVID-19 with moderate predictive power (Q2 = 0.24) and high accuracy (AUC > 0.85) through separating non-survivors from survivors developed using training and validation sets. This model was obtained by the 18 clinical and comorbidities predictors and 3 blood biochemical markers. Coronary artery disease, diabetes, Altered Mental Status, age > 65, and dementia were the topmost differentiating mortality predictors. CRP, prothrombin, and lactate were the most differentiating biochemical markers in the mortality prediction model. Clustering analysis identified high- and low-risk patients among COVID-19 survivors.

Conclusions: An accurate COVID-19 mortality prediction model among hospitalized patients based on the clinical features and comorbidities may play a beneficial role in the clinical setting to better management of patients with COVID-19. The current study revealed the application of machine-learning-based approaches to predict hospital mortality in patients with COVID-19 and identification of most important predictors from clinical, comorbidities and blood biochemical variables as well as recognizing high- and low-risk COVID-19 survivors.

Keywords: COVID-19; Machine learning; Mortality; Prediction model; SARS-CoV-2.

PubMed Disclaimer

Conflict of interest statement

The authors declare they have no conflict of interest.

Figures

Fig. 1
Fig. 1
SIMPLS-based scatter plot shows a good separation between hospital mortality of patients with COVID-19 from survivors. The figure illustrates only the training set-based scatter plot
Fig. 2
Fig. 2
AUC for the separation of hospital mortality and survivors from COVID-19
Fig. 3
Fig. 3
Predictive partition platform analysis shows the decision tree that predicts the hospital mortality in patients with COVID-19 from survivors. Blue square: survivors, red square: hospital mortality
Fig. 4
Fig. 4
PCA plot illustrates the LCA-based clustering of patients with COVID-19. Clusters 2 and 3 are associated with a higher rate of mortality. Black circle: Survivors, red square: Hospital mortality
Fig. 5
Fig. 5
SIMPLS-based scatter plot shows a very good separation between three clusters obtained by LCA. Clusters 1 includes the patients with a lower risk of dying, and clusters 2 and 3 include patients with a higher risk of dying

Comment in

  • Demystifying machine learning for mortality prediction.
    Smit JM, van Genderen ME, Reinders MJT, Gommers DAMPJ, Krijthe JH, Van Bommel J. Smit JM, et al. Crit Care. 2021 Dec 23;25(1):447. doi: 10.1186/s13054-021-03868-z. Crit Care. 2021. PMID: 34949229 Free PMC article. No abstract available.

Similar articles

Cited by

References

    1. Dhama K, Khan S, Tiwari R, Sircar S, Bhat S, Malik YS, Singh KP, Chaicumpa W, Bonilla-Aldana DK, Rodriguez-Morales AJ. Coronavirus disease 2019-COVID-19. Clin Microbiol Rev. 2020;33(4):e00028–e120. doi: 10.1128/CMR.00028-20. - DOI - PMC - PubMed
    1. Hassan SA, Sheikh FN, Jamal S, Ezeh JK, Akhtar A. Coronavirus (COVID-19): a review of clinical features, diagnosis, and treatment. Cureus. 2020;12(3):e7355. - PMC - PubMed
    1. Guan WJ, Ni ZY, Hu Y, Liang WH, Ou CQ, He JX, Liu L, Shan H, Lei CL, Hui DSC, et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382(18):1708–1720. doi: 10.1056/NEJMoa2002032. - DOI - PMC - PubMed
    1. Chen Y, Ouyang L, Bao FS, Li Q, Han L, Zhang H, Zhu B, Ge Y, Robinson P, Xu M, et al. A multimodality machine learning approach to differentiate severe and nonsevere COVID-19: model development and validation. J Med Internet Res. 2021;23(4):e23948. doi: 10.2196/23948. - DOI - PMC - PubMed
    1. Elwazir MY, Hosny S. Artificial intelligence in COVID-19 ultrastructure. J Microsc Ultrastruct. 2020;8(4):146–147. doi: 10.4103/JMAU.JMAU_28_20. - DOI - PMC - PubMed