Machine learning for subtype definition and risk prediction in heart failure, acute coronary syndromes and atrial fibrillation: systematic review of validity and clinical utility

Amitava Banerjee; Suliang Chen; Ghazaleh Fatemifar; Mohamad Zeina; R Thomas Lumbers; Johanna Mielke; Simrat Gill; Dipak Kotecha; Daniel F Freitag; Spiros Denaxas; Harry Hemingway

doi:10.1186/s12916-021-01940-7

Machine learning for subtype definition and risk prediction in heart failure, acute coronary syndromes and atrial fibrillation: systematic review of validity and clinical utility

BMC Med. 2021 Apr 6;19(1):85. doi: 10.1186/s12916-021-01940-7.

Authors

Amitava Banerjee^{1

2

3

4}, Suliang Chen^{5

6}, Ghazaleh Fatemifar^{5

6}, Mohamad Zeina⁷, R Thomas Lumbers^{5

6

8}, Johanna Mielke⁹, Simrat Gill¹⁰, Dipak Kotecha^{10

11}, Daniel F Freitag⁹, Spiros Denaxas^{5

6

12}, Harry Hemingway^{5

6

13}

Affiliations

¹ Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK. ami.banerjee@ucl.ac.uk.
² Health Data Research UK, University College London, London, UK. ami.banerjee@ucl.ac.uk.
³ University College London Hospitals NHS Trust, 235 Euston Road, London, UK. ami.banerjee@ucl.ac.uk.
⁴ Barts Health NHS Trust, The Royal London Hospital, Whitechapel Rd, London, UK. ami.banerjee@ucl.ac.uk.
⁵ Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK.
⁶ Health Data Research UK, University College London, London, UK.
⁷ Medical School, King's College London, London, UK.
⁸ University College London Hospitals NHS Trust, 235 Euston Road, London, UK.
⁹ Bayer AG, Division Pharmaceuticals, Open Innovation & Digital Technologies, Wuppertal, Germany.
¹⁰ University of Birmingham Institute of Cardiovascular Sciences and University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.
¹¹ Department of Cardiology, University Medical Centre Utrecht, Utrecht, the Netherlands.
¹² The Alan Turing Institute, London, UK.
¹³ University College London Hospitals Biomedical Research Centre (UCLH BRC), London, UK.

Abstract

Background: Machine learning (ML) is increasingly used in research for subtype definition and risk prediction, particularly in cardiovascular diseases. No existing ML models are routinely used for cardiovascular disease management, and their phase of clinical utility is unknown, partly due to a lack of clear criteria. We evaluated ML for subtype definition and risk prediction in heart failure (HF), acute coronary syndromes (ACS) and atrial fibrillation (AF).

Methods: For ML studies of subtype definition and risk prediction, we conducted a systematic review in HF, ACS and AF, using PubMed, MEDLINE and Web of Science from January 2000 until December 2019. By adapting published criteria for diagnostic and prognostic studies, we developed a seven-domain, ML-specific checklist.

Results: Of 5918 studies identified, 97 were included. Across studies for subtype definition (n = 40) and risk prediction (n = 57), there was variation in data source, population size (median 606 and median 6769), clinical setting (outpatient, inpatient, different departments), number of covariates (median 19 and median 48) and ML methods. All studies were single disease, most were North American (n = 61/97) and only 14 studies combined definition and risk prediction. Subtype definition and risk prediction studies respectively had limitations in development (e.g. 15.0% and 78.9% of studies related to patient benefit; 15.0% and 15.8% had low patient selection bias), validation (12.5% and 5.3% externally validated) and impact (32.5% and 91.2% improved outcome prediction; no effectiveness or cost-effectiveness evaluations).

Conclusions: Studies of ML in HF, ACS and AF are limited by number and type of included covariates, ML methods, population size, country, clinical setting and focus on single diseases, not overlap or multimorbidity. Clinical utility and implementation rely on improvements in development, validation and impact, facilitated by simple checklists. We provide clear steps prior to safe implementation of machine learning in clinical practice for cardiovascular diseases and other disease areas.

Keywords: Cardiovascular disease; Informatics; Machine learning; Risk prediction; Subtype; Systematic review.

Publication types

Research Support, Non-U.S. Gov't
Systematic Review

MeSH terms

Acute Coronary Syndrome* / diagnosis
Acute Coronary Syndrome* / epidemiology
Atrial Fibrillation* / diagnosis
Atrial Fibrillation* / epidemiology
Cost-Benefit Analysis
Heart Failure* / diagnosis
Heart Failure* / epidemiology
Humans
Machine Learning

Abstract

Publication types

MeSH terms

Grants and funding