Scalable and accurate deep learning with electronic health records
- PMID: 31304302
- PMCID: PMC6550175
- DOI: 10.1038/s41746-018-0029-1
Scalable and accurate deep learning with electronic health records
Abstract
Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient's record. We propose a representation of patients' entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two US academic medical centers with 216,221 adult patients hospitalized for at least 24 h. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting: in-hospital mortality (area under the receiver operator curve [AUROC] across sites 0.93-0.94), 30-day unplanned readmission (AUROC 0.75-0.76), prolonged length of stay (AUROC 0.85-0.86), and all of a patient's final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed traditional, clinically-used predictive models in all cases. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios. In a case study of a particular prediction, we demonstrate that neural networks can be used to identify relevant information from the patient's chart.
Keywords: Machine learning; Medical research.
Conflict of interest statement
Competing interestsThe authors declare no competing interests.
Figures
Similar articles
-
Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14. J Biomed Inform. 2019. PMID: 31622801 Free PMC article.
-
Predicting next-day discharge via electronic health record access logs.J Am Med Inform Assoc. 2021 Nov 25;28(12):2670-2680. doi: 10.1093/jamia/ocab211. J Am Med Inform Assoc. 2021. PMID: 34592753 Free PMC article.
-
Early Detection of Septic Shock Onset Using Interpretable Machine Learners.J Clin Med. 2021 Jan 15;10(2):301. doi: 10.3390/jcm10020301. J Clin Med. 2021. PMID: 33467539 Free PMC article.
-
Deep learning prediction models based on EHR trajectories: A systematic review.J Biomed Inform. 2023 Aug;144:104430. doi: 10.1016/j.jbi.2023.104430. Epub 2023 Jun 26. J Biomed Inform. 2023. PMID: 37380061 Review.
-
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31. J Biomed Inform. 2021. PMID: 33387683 Review.
Cited by
-
Cognitive Computing-Based CDSS in Medical Practice.Health Data Sci. 2021 Jul 22;2021:9819851. doi: 10.34133/2021/9819851. eCollection 2021. Health Data Sci. 2021. PMID: 38487503 Free PMC article. Review.
-
Development and validation of 'Patient Optimizer' (POP) algorithms for predicting surgical risk with machine learning.BMC Med Inform Decis Mak. 2024 Mar 11;24(1):70. doi: 10.1186/s12911-024-02463-w. BMC Med Inform Decis Mak. 2024. PMID: 38468330 Free PMC article.
-
Multimodal risk prediction with physiological signals, medical images and clinical notes.Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15. Heliyon. 2024. PMID: 38455585 Free PMC article.
-
Exploring the application and future outlook of Artificial intelligence in pancreatic cancer.Front Oncol. 2024 Feb 21;14:1345810. doi: 10.3389/fonc.2024.1345810. eCollection 2024. Front Oncol. 2024. PMID: 38450187 Free PMC article. Review.
-
Real-time prognostic biomarkers for predicting in-hospital mortality and cardiac complications in COVID-19 patients.PLOS Glob Public Health. 2024 Mar 6;4(3):e0002836. doi: 10.1371/journal.pgph.0002836. eCollection 2024. PLOS Glob Public Health. 2024. PMID: 38446834 Free PMC article.
References
-
- The Digital Universe: Driving Data Growth in Healthcare. Available at: https://www.emc.com/analyst-report/digital-universe-healthcare-vertical-... (Accessed 23 Feb 2017).
LinkOut - more resources
Full Text Sources
Other Literature Sources
