Integrating spatial gene expression and breast tumour morphology via deep learning

Nat Biomed Eng. 2020 Aug;4(8):827-834. doi: 10.1038/s41551-020-0578-x. Epub 2020 Jun 22.

Abstract

Spatial transcriptomics allows for the measurement of RNA abundance at a high spatial resolution, making it possible to systematically link the morphology of cellular neighbourhoods and spatially localized gene expression. Here, we report the development of a deep learning algorithm for the prediction of local gene expression from haematoxylin-and-eosin-stained histopathology images using a new dataset of 30,612 spatially resolved gene expression data matched to histopathology images from 23 patients with breast cancer. We identified over 100 genes, including known breast cancer biomarkers of intratumoral heterogeneity and the co-localization of tumour growth and immune activation, the expression of which can be predicted from the histopathology images at a resolution of 100 µm. We also show that the algorithm generalizes well to The Cancer Genome Atlas and to other breast cancer gene expression datasets without the need for re-training. Predicting the spatially resolved transcriptome of a tissue directly from tissue images may enable image-based screening for molecular biomarkers with spatial variation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism
  • Breast Neoplasms / genetics*
  • Breast Neoplasms / metabolism
  • Breast Neoplasms / pathology*
  • Deep Learning*
  • Female
  • Gene Expression Profiling / methods
  • Humans
  • Image Processing, Computer-Assisted
  • Reproducibility of Results
  • Transcriptome

Substances

  • Biomarkers, Tumor