Does machine learning improve prediction of VA primary care reliance?

Edwin S Wong; Linnaea Schuttner; Ashok Reddy

doi:10.37765/ajmc.2020.42144

Does machine learning improve prediction of VA primary care reliance?

Am J Manag Care. 2020 Jan;26(1):40-44. doi: 10.37765/ajmc.2020.42144.

Authors

Edwin S Wong¹, Linnaea Schuttner, Ashok Reddy

Affiliation

¹ Center for Veteran-Centered and Value-Driven Care, VA Puget Sound Health Care System, 1660 S Columbian Way, HSR&D MS-152, Seattle, WA 98108. Email: edwin.wong@va.gov.

PMID: 31951358
DOI: 10.37765/ajmc.2020.42144

Abstract

Objectives: The Veterans Affairs (VA) Health Care System is among the largest integrated health systems in the United States. Many VA enrollees are dual users of Medicare, and little research has examined methods to most accurately predict which veterans will be mostly reliant on VA services in the future. This study examined whether machine learning methods can better predict future reliance on VA primary care compared with traditional statistical methods.

Study design: Observational study of 83,143 VA patients dually enrolled in fee-for-service Medicare using VA and Medicare administrative databases and the 2012 Survey of Healthcare Experiences of Patients.

Methods: The primary outcome was a dichotomous measure denoting whether patients obtained more than 50% of all primary care visits (VA + Medicare) from VA. We compared the performance of 6 candidate models-logistic regression, elastic net regression, decision trees, random forest, gradient boosting machine, and neural network-in predicting 2013 reliance as a function of 61 patient characteristics observed in 2012. We measured performance using the cross-validated area under the receiver operating characteristic (AUROC) metric.

Results: Overall, 72.9% and 74.5% of veterans were mostly VA reliant in 2012 and 2013, respectively. All models had similar average AUROCs, ranging from 0.873 to 0.892. The best-performing model used gradient boosting machine, which exhibited modestly higher AUROC and similar variance compared with standard logistic regression.

Conclusions: The modest gains in performance from the best-performing model, gradient boosting machine, are unlikely to outweigh inherent drawbacks, including computational complexity and limited interpretability compared with traditional logistic regression.

Publication types

Observational Study
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Aged
Aged, 80 and over
Female
Forecasting / methods
Humans
Logistic Models
Machine Learning*
Male
Medicare
Middle Aged
Patient Acceptance of Health Care / statistics & numerical data*
Primary Health Care / statistics & numerical data*
United States
United States Department of Veterans Affairs
Veterans Health Services / trends*