Predicting Hospital Readmission in Heart Failure Patients in Iran: A Comparison of Various Machine Learning Methods

Roya Najafi-Vosough; Javad Faradmal; Seyed Kianoosh Hosseini; Abbas Moghimbeigi; Hossein Mahjub

doi:10.4258/hir.2021.27.4.307

Predicting Hospital Readmission in Heart Failure Patients in Iran: A Comparison of Various Machine Learning Methods

Healthc Inform Res. 2021 Oct;27(4):307-314. doi: 10.4258/hir.2021.27.4.307. Epub 2021 Oct 31.

Authors

Roya Najafi-Vosough¹, Javad Faradmal^{1

2}, Seyed Kianoosh Hosseini³, Abbas Moghimbeigi^{4

5}, Hossein Mahjub^{1

6}

Affiliations

¹ Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.
² Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran.
³ Department of Cardiology, School of Medicine, Hamadan University of Medical Sciences, Hamadan, Iran.
⁴ Department of Biostatistics and Epidemiology, Faculty of Health, Alborz University of Medical Sciences, Karaj, Iran.
⁵ Research Center for Health, Safety and Environment, Alborz University of Medical Sciences, Karaj, Iran.
⁶ Research Center for Health Sciences, Hamadan University of Medical Sciences, Hamadan, Iran.

Abstract

Objectives: Heart failure (HF) is a common disease with a high hospital readmission rate. This study considered class imbalance and missing data, which are two common issues in medical data. The current study's main goal was to compare the performance of six machine learning (ML) methods for predicting hospital readmission in HF patients.

Methods: In this retrospective cohort study, information of 1,856 HF patients was analyzed. These patients were hospitalized in Farshchian Heart Center in Hamadan Province in Western Iran, from October 2015 to July 2019. The support vector machine (SVM), least-square SVM (LS-SVM), bagging, random forest (RF), AdaBoost, and naïve Bayes (NB) methods were used to predict hospital readmission. These methods' performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, and accuracy. Two imputation methods were also used to deal with missing data.

Results: Of the 1,856 HF patients, 29.9% had at least one hospital readmission. Among the ML methods, LS-SVM performed the worst, with accuracy in the range of 0.57-0.60, while RF performed the best, with the highest accuracy (range, 0.90-0.91). Other ML methods showed relatively good performance, with accuracy exceeding 0.84 in the test datasets. Furthermore, the performance of the SVM and LS-SVM methods in terms of accuracy was higher with the multiple imputation method than with the median imputation method.

Conclusions: This study showed that RF performed better, in terms of accuracy, than other methods for predicting hospital readmission in HF patients.

Keywords: Classification; Data Analysis; Heart Failure; Machine Learning; Patient Readmission.

Grants and funding

Hamadan University of Medical Sciences