Multimodal fusion with deep neural networks for leveraging CT imaging and electronic health record: a case-study in pulmonary embolism detection

Sci Rep. 2020 Dec 17;10(1):22147. doi: 10.1038/s41598-020-78888-w.


Recent advancements in deep learning have led to a resurgence of medical imaging and Electronic Medical Record (EMR) models for a variety of applications, including clinical decision support, automated workflow triage, clinical prediction and more. However, very few models have been developed to integrate both clinical and imaging data, despite that in routine practice clinicians rely on EMR to provide context in medical imaging interpretation. In this study, we developed and compared different multimodal fusion model architectures that are capable of utilizing both pixel data from volumetric Computed Tomography Pulmonary Angiography scans and clinical patient data from the EMR to automatically classify Pulmonary Embolism (PE) cases. The best performing multimodality model is a late fusion model that achieves an AUROC of 0.947 [95% CI: 0.946-0.948] on the entire held-out test set, outperforming imaging-only and EMR-only single modality models.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Clinical Decision-Making
  • Disease Management
  • Electronic Health Records*
  • Humans
  • Image Interpretation, Computer-Assisted
  • Image Processing, Computer-Assisted / methods*
  • Machine Learning
  • Neural Networks, Computer*
  • Pulmonary Embolism / diagnosis*
  • Sensitivity and Specificity
  • Tomography, X-Ray Computed* / methods
  • Workflow