Machine learning for prediction of in-hospital mortality in coronavirus disease 2019 patients: results from an Italian multicenter study

Marika Vezzoli; Riccardo Maria Inciardi; Chiara Oriecuia; Sara Paris; Natalia Herrera Murillo; Piergiuseppe Agostoni; Pietro Ameri; Antonio Bellasi; Rita Camporotondo; Claudia Canale; Valentina Carubelli; Stefano Carugo; Francesco Catagnano; Giambattista Danzi; Laura Dalla Vecchia; Stefano Giovinazzo; Massimiliano Gnecchi; Marco Guazzi; Anita Iorio; Maria Teresa La Rovere; Sergio Leonardi; Gloria Maccagni; Massimo Mapelli; Davide Margonato; Marco Merlo; Luca Monzo; Andrea Mortara; Vincenzo Nuzzi; Matteo Pagnesi; Massimo Piepoli; Italo Porto; Andrea Pozzi; Giovanni Provenzale; Filippo Sarullo; Michele Senni; Gianfranco Sinagra; Daniela Tomasoni; Marianna Adamo; Maurizio Volterrani; Roberto Maroldi; Marco Metra; Carlo Mario Lombardi; Claudia Specchia

doi:10.2459/JCM.0000000000001329

Machine learning for prediction of in-hospital mortality in coronavirus disease 2019 patients: results from an Italian multicenter study

J Cardiovasc Med (Hagerstown). 2022 Jul 1;23(7):439-446. doi: 10.2459/JCM.0000000000001329.

Authors

Marika Vezzoli¹, Riccardo Maria Inciardi², Chiara Oriecuia^{1

2}, Sara Paris², Natalia Herrera Murillo², Piergiuseppe Agostoni^{3

4}, Pietro Ameri⁵, Antonio Bellasi⁶, Rita Camporotondo⁷, Claudia Canale⁵, Valentina Carubelli², Stefano Carugo⁸, Francesco Catagnano^{7

9}, Giambattista Danzi¹⁰, Laura Dalla Vecchia¹¹, Stefano Giovinazzo⁵, Massimiliano Gnecchi⁷, Marco Guazzi¹², Anita Iorio¹³, Maria Teresa La Rovere¹⁴, Sergio Leonardi⁷, Gloria Maccagni¹², Massimo Mapelli^{3

4}, Davide Margonato^{7

9}, Marco Merlo¹⁵, Luca Monzo^{16

17}, Andrea Mortara⁹, Vincenzo Nuzzi¹⁵, Matteo Pagnesi², Massimo Piepoli^{18

19}, Italo Porto⁵, Andrea Pozzi¹³, Giovanni Provenzale⁸, Filippo Sarullo²⁰, Michele Senni¹³, Gianfranco Sinagra¹⁵, Daniela Tomasoni², Marianna Adamo², Maurizio Volterrani²¹, Roberto Maroldi²², Marco Metra², Carlo Mario Lombardi², Claudia Specchia¹

Affiliations

¹ Department of Molecular and Translational Medicine, University of Brescia, Italy.
² Cardiology, ASST Spedali Civili di Brescia and Department of Medical and Surgical Specialties, Radiological Sciences and Public Health, University of Brescia, Brescia.
³ Centro Cardiologico Monzino, IRCCS, Department of Clinical Sciences and Community Health, University of Milano, Milan.
⁴ Department of Clinical Sciences and Community Health, University of Milano, Milan.
⁵ IRCCS Ospedale Policlinico San Martino and Department of Internal Medicine, University of Genova, Genova.
⁶ Innovation and Brand Reputation Unit, Papa Giovanni XXIII Hospital, Bergamo.
⁷ Fondazione IRCCS Policlinico S. Matteo and University of Pavia, Pavia.
⁸ Division of Cardiology, Ospedale San Paolo, ASST Santi Paolo E Carlo, University of Milano, Milan.
⁹ Cardiology Department, Policlinico Di Monza, Monza.
¹⁰ Division of Cardiology, Ospedale Maggiore Di Cremona, Cremona.
¹¹ Department of Cardiology, Istituti Clinici Scientifici Maugeri, IRCCS, Istituto Scientifico Di Milano, Milan.
¹² Heart Failure Unit, Cardiology Department, IRCCS San Donato Hospital, University of Milan, Milan.
¹³ Cardiovascular Department and Cardiology Unit, Papa Giovanni XXIII Hospital-Bergamo, Bergamo.
¹⁴ Department of Cardiology, Istituti Clinici Scientifici Maugeri, IRCCS, Istituto Scientifico Di Pavia, Pavia.
¹⁵ Cardiovascular Department, Azienda Sanitaria Universitaria Integrata, Trieste.
¹⁶ Istituto Clinico Casal Palocco, Rome.
¹⁷ Policlinico Casilino, Rome.
¹⁸ Heart Failure Unit, G da Saliceto Hospital, AUSL Piacenza, Piacenza.
¹⁹ Institute of Life Sciences, Sant'Anna School of Advanced Studies, Pisa.
²⁰ Cardiovascular Rehabilitation Unit, Buccheri La Ferla Fatebenefratelli Hospital, Palermo.
²¹ Department of Medical Sciences, Istituto Di Ricovero E Cura a Carattere Scientifico (IRCCS) San Raffaele Pisana.
²² Radiology ASST Spedali Civili di Brescia and Department of Medical and Surgical, Specialties, Radiological Sciences and Public Health, University of Brescia, Brescia, Italy.

PMID: 35763764
DOI: 10.2459/JCM.0000000000001329

Abstract

Background: Several risk factors have been identified to predict worse outcomes in patients affected by SARS-CoV-2 infection. Machine learning algorithms represent a novel approach to identifying a prediction model with a good discriminatory capacity to be easily used in clinical practice. The aim of this study was to obtain a risk score for in-hospital mortality in patients with coronavirus disease infection (COVID-19) based on a limited number of features collected at hospital admission.

Methods and results: We studied an Italian cohort of consecutive adult Caucasian patients with laboratory-confirmed COVID-19 who were hospitalized in 13 cardiology units during Spring 2020. The Lasso procedure was used to select the most relevant covariates. The dataset was randomly divided into a training set containing 80% of the data, used for estimating the model, and a test set with the remaining 20%. A Random Forest modeled in-hospital mortality with the selected set of covariates: its accuracy was measured by means of the ROC curve, obtaining AUC, sensitivity, specificity and related 95% confidence interval (CI). This model was then compared with the one obtained by the Gradient Boosting Machine (GBM) and with logistic regression. Finally, to understand if each model has the same performance in the training and test set, the two AUCs were compared using the DeLong's test. Among 701 patients enrolled (mean age 67.2 ± 13.2 years, 69.5% male individuals), 165 (23.5%) died during a median hospitalization of 15 (IQR, 9-24) days. Variables selected by the Lasso procedure were: age, oxygen saturation, PaO2/FiO2, creatinine clearance and elevated troponin. Compared with those who survived, deceased patients were older, had a lower blood oxygenation, lower creatinine clearance levels and higher prevalence of elevated troponin (all P < 0.001). The best performance out of the samples was provided by Random Forest with an AUC of 0.78 (95% CI: 0.68-0.88) and a sensitivity of 0.88 (95% CI: 0.58-1.00). Moreover, Random Forest was the unique model that provided similar performance in sample and out of sample (DeLong test P = 0.78).

Conclusion: In a large COVID-19 population, we showed that a customizable machine learning-based score derived from clinical variables is feasible and effective for the prediction of in-hospital mortality.

Publication types

Multicenter Study

MeSH terms

Aged
Aged, 80 and over
COVID-19* / diagnosis
Creatinine
Female
Hospital Mortality
Humans
Machine Learning
Male
Middle Aged
SARS-CoV-2
Troponin

Substances

Troponin
Creatinine