Automated, High-Throughput Platform to Generate a High-Reliability, Comprehensive Rectal Cancer Database

JCO Clin Cancer Inform. 2024 May:8:e2300219. doi: 10.1200/CCI.23.00219.

Abstract

Purpose: Dynamic operations platforms allow for cross-platform data extraction, integration, and analysis, although application of these platforms to large-scale oncology enterprises has not been described. This study presents a pipeline for automated, high-fidelity extraction, integration, and validation of cross-platform oncology data in patients undergoing treatment for rectal cancer at a single, high-volume institution.

Methods: A dynamic operations platform was used to identify patients with rectal cancer treated at MD Anderson Cancer Center between 2016 and 2022 who had magnetic resonance imaging (MRI) imaging and preoperative treatment details available in the electronic health record (EHR). Demographic, clinicopathologic, tumor mutation, radiographic, and treatment data were extracted from the EHR using a methodology adaptable to any disease site. Data accuracy was assessed by manual review. Accuracy before and after implementation of synoptic reporting was determined for MRI data.

Results: A total of 516 patients with localized rectal cancer were included. In the era after institutional adoption of synoptic reports, the dynamic operations platform extracted T (tumor) category data from the EHR with 95% accuracy compared with 87% before the use of synoptic reports, and N (lymph node) category with 88% compared with 58%. Correct extraction of pelvic sidewall adenopathy was 94% compared with 78%, and extramural vascular invasion accuracy was 99% compared with 89%. Neoadjuvant chemotherapy and radiation data were 99% accurate for patients who had synoptic data sources.

Conclusion: Using dynamic operations platforms enables automated cross-platform integration of multiparameter oncology data with high fidelity in patients undergoing multimodality treatment for rectal cancer. These pipelines can be adapted to other solid tumors and, together with standardized reporting, can increase efficiency in clinical research and the translation of actionable findings toward optimizing patient outcomes.

MeSH terms

  • Adult
  • Aged
  • Databases, Factual*
  • Electronic Health Records
  • Female
  • Humans
  • Magnetic Resonance Imaging* / methods
  • Male
  • Middle Aged
  • Neoplasm Staging
  • Rectal Neoplasms* / diagnosis
  • Rectal Neoplasms* / diagnostic imaging
  • Rectal Neoplasms* / pathology
  • Rectal Neoplasms* / therapy
  • Reproducibility of Results