Development and comparison of machine learning-based models for predicting heart failure after acute myocardial infarction

BMC Med Inform Decis Mak. 2023 Aug 24;23(1):165. doi: 10.1186/s12911-023-02240-1.

Abstract

Aims: Heart failure (HF) is one of the common adverse cardiovascular events after acute myocardial infarction (AMI), but the predictive efficacy of numerous machine learning (ML) built models is unclear. This study aimed to build an optimal model to predict the occurrence of HF in AMI patients by comparing seven ML algorithms.

Methods: Cohort 1 included AMI patients from 2018 to 2019 divided into HF and control groups. All first routine test data of the study subjects were collected as the features to be selected for the model, and seven ML algorithms with screenable features were evaluated. Cohort 2 contains AMI patients from 2020 to 2021 to establish an early warning model with external validation. ROC curve and DCA curve to analyze the diagnostic efficacy and clinical benefit of the model respectively.

Results: The best performer among the seven ML algorithms was XgBoost, and the features of XgBoost algorithm for troponin I, triglycerides, urine red blood cell count, γ-glutamyl transpeptidase, glucose, urine specific gravity, prothrombin time, prealbumin, and urea were ranked high in importance. The AUC of the HF-Lab9 prediction model built by the XgBoost algorithm was 0.966 and had good clinical benefits.

Conclusions: This study screened the optimal ML algorithm as XgBoost and developed the model HF-Lab9 will improve the accuracy of clinicians in assessing the occurrence of HF after AMI and provide a reference for the selection of subsequent model-building algorithms.

Keywords: Acute myocardial infarction; Extreme gradient boosting; Heart failure; Machine learning; Model.

MeSH terms

  • Algorithms
  • Heart Failure* / diagnosis
  • Heart Failure* / etiology
  • Humans
  • Machine Learning
  • Myocardial Infarction* / complications
  • Myocardial Infarction* / diagnosis
  • ROC Curve