Predicting High-Cost Patients at Point of Admission Using Network Science

IEEE J Biomed Health Inform. 2018 Nov;22(6):1970-1977. doi: 10.1109/JBHI.2017.2783049. Epub 2017 Dec 13.


Data mining models for high-cost patient encounter prediction at the point-of-admission (HPEPP) in inpatient wards are scarce in the literature. This is due to the lack of availability of relevant features at such an early stage of treatment. In this study, we create a disease co-occurrence network (DCN) using a subset of the state inpatient database of Arizona. We explore this network for community formation and structural properties to create new input features for HPEPP models. Tree-based data mining models are trained using input feature sets including these new network features, and distinct disease communities in the DCN are identified. We propose community membership and high-cost propensity scores as two network-based features for HPEPP modeling. We compare the performance of models with different input feature sets and find that the new features significantly improve the accuracy sensitivity and specificity of prediction models. This model has the potential to improve targeted care management and reduce health care expenditure.

MeSH terms

  • Adolescent
  • Adult
  • Child
  • Child, Preschool
  • Chronic Disease
  • Electronic Health Records
  • Female
  • Health Care Costs / statistics & numerical data*
  • Hospitalization / economics*
  • Hospitalization / statistics & numerical data*
  • Humans
  • Infant
  • Infant, Newborn
  • Male
  • Medical Informatics / methods*
  • Middle Aged
  • Models, Statistical*
  • Young Adult