Background: Machine learning (ML) employs algorithms that learn from data, building models with the potential to predict events by aggregating a large number of variables and assessing their complex interactions. The aim of this study is to assess ML potential in identifying patients with ischemic heart disease (IHD) at high risk of cardiac death (CD).
Methods: 3987 (mean age 68 ± 11) hospitalized IHD patients were enrolled. We implemented and compared various ML models and their combination into ensembles. Model output constitutes a new ML indicator to be employed for stratification. Primary variable importance was assessed with ablation tests.
Results: An ensemble classifier combining three ML models achieved the best performance to predict CD (AUROC of 0.830, F1-macro of 0.726). ML indicator use through Cox survival analysis outperformed the 18 variables individually, producing a better stratification compared to standard multivariate analysis (improvement of ∼20%). Patients in the low risk group defined through ML indicator had a significantly higher survival (88.8% versus 29.1%). The main variables identified were Dyslipidemia, LVEF, Previous CABG, Diabetes, Previous Myocardial Infarction, Smoke, Documented resting or exertional ischemia, with an AUROC of 0.791 and an F1-score of 0.674, lower than that of 18 variables. Both code and clinical data are freely available with this article.
Conclusion: ML may allow a faster, low-cost and reliable evaluation of IHD patient prognosis by inclusion of more predictors and identification of those more significant, improving outcome prediction towards the development of precision medicine in this clinical field.
Keywords: Artificial intelligence; Ischemic heart disease; Machine learning; Prognosis; Survival analysis.
Copyright © 2024 The Authors. Published by Elsevier B.V. All rights reserved.