Machine learning for classification of postoperative patient status using standardized medical data

Comput Methods Programs Biomed. 2022 Feb:214:106583. doi: 10.1016/j.cmpb.2021.106583. Epub 2021 Dec 12.

Abstract

Background and objective: Real-world evidence is defined as clinical evidence regarding the use and potential benefits or risks of a medical product derived from real-world data analyses. Standardization and structuring of data are necessary to analyze medical real-world data collected from different medical institutions. An electronic message and repository have been developed to link electronic medical records in this research project, which has simplified the data integration. Therefore, this paper proposes an analysis method and learning health systems to determine the priority of clinical intervention by clustering and visualizing time-series and prioritizing patient outcomes and status during hospitalization.

Methods: Common data items for reimbursement (Diagnosis Procedure Combination [DPC]) and clinical pathway data were examined in this project at each participating institution that runs the verification test. Long-term hospitalization data were analyzed using the data stored in the cloud platform of the institutions' repositories using multiple machine learning methods for classification, visualization, and interpretation.

Results: The ePath platform contributed to integrate the standardized data from multiple institutions. The distribution of DPC items or variances could be confirmed by clustering, temporal tendency through the directed graph, and extracting variables that contributed to the prediction and evaluation of SHapley Additive Explanation effects. Constipation was determined to be the risk factor most strongly related to long-term hospitalization. Drainage management was identified as a factor that can improve long-term hospitalization. These analyses effectively extracted patient status to provide feedback to the learning health system.

Conclusions: We successfully generated evidence of medical processes by gathering patient status, medical purposes, and patient outcomes with high data quality from multiple institutions, which were difficult with conventional electronic medical records. Regarding the significant analysis results, the learning health system will be used on this project to provide feedback to each institution, operate it for a certain period, and analyze and re-evaluate it.

Keywords: Clinical pathway; Diagnosis procedure combination (DPC); Learning health system; Machine learning; Real-world data (RWD).

MeSH terms

  • Electronic Health Records*
  • Hospitalization
  • Humans
  • Machine Learning*
  • Postoperative Period
  • Risk Factors