A machine learning framework supporting prospective clinical decisions applied to risk prediction in oncology

Lorinda Coombs; Abigail Orlando; Xiaoliang Wang; Pooja Shaw; Alexander S Rich; Shreyas Lakhtakia; Karen Titchener; Blythe Adamson; Rebecca A Miksad; Kathi Mooney

doi:10.1038/s41746-022-00660-3

A machine learning framework supporting prospective clinical decisions applied to risk prediction in oncology

NPJ Digit Med. 2022 Aug 16;5(1):117. doi: 10.1038/s41746-022-00660-3.

Authors

Affiliations

¹ Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA.
² University of North Carolina-Chapel Hill, Lineberger Cancer Institute, Chapel Hill, NC, USA.
³ Flatiron Health, Inc, New York, NY, USA.
⁴ Flatiron Health, Inc, New York, NY, USA. ramiksad@flatiron.com.

Abstract

We present a general framework for developing a machine learning (ML) tool that supports clinician assessment of patient risk using electronic health record-derived real-world data and apply the framework to a quality improvement use case in an oncology setting to identify patients at risk for a near-term (60 day) emergency department (ED) visit who could potentially be eligible for a home-based acute care program. Framework steps include defining clinical quality improvement goals, model development and validation, bias assessment, retrospective and prospective validation, and deployment in clinical workflow. In the retrospective analysis for the use case, 8% of patient encounters were associated with a high risk (pre-defined as predicted probability ≥20%) for a near-term ED visit by the patient. Positive predictive value (PPV) and negative predictive value (NPV) for future ED events was 26% and 91%, respectively. Odds ratio (OR) of ED visit (high- vs. low-risk) was 3.5 (95% CI: 3.4-3.5). The model appeared to be calibrated across racial, gender, and ethnic groups. In the prospective analysis, 10% of patients were classified as high risk, 76% of whom were confirmed by clinicians as eligible for home-based acute care. PPV and NPV for future ED events was 22% and 95%, respectively. OR of ED visit (high- vs. low-risk) was 5.4 (95% CI: 2.6-11.0). The proposed framework for an ML-based tool that supports clinician assessment of patient risk is a stepwise development approach; we successfully applied the framework to an ED visit risk prediction use case.

Grants and funding

T32NR013456/U.S. Department of Health & Human Services | NIH | National Institute of Nursing Research (NINR)