Identification and transfer of spatial transcriptomics signatures for cancer diagnosis

Breast Cancer Res. 2020 Jan 13;22(1):6. doi: 10.1186/s13058-019-1242-9.

Abstract

Background: Distinguishing ductal carcinoma in situ (DCIS) from invasive ductal carcinoma (IDC) regions in clinical biopsies constitutes a diagnostic challenge. Spatial transcriptomics (ST) is an in situ capturing method, which allows quantification and visualization of transcriptomes in individual tissue sections. In the past, studies have shown that breast cancer samples can be used to study their transcriptomes with spatial resolution in individual tissue sections. Previously, supervised machine learning methods were used in clinical studies to predict the clinical outcomes for cancer types.

Methods: We used four publicly available ST breast cancer datasets from breast tissue sections annotated by pathologists as non-malignant, DCIS, or IDC. We trained and tested a machine learning method (support vector machine) based on the expert annotation as well as based on automatic selection of cell types by their transcriptome profiles.

Results: We identified expression signatures for expert annotated regions (non-malignant, DCIS, and IDC) and build machine learning models. Classification results for 798 expression signature transcripts showed high coincidence with the expert pathologist annotation for DCIS (100%) and IDC (96%). Extending our analysis to include all 25,179 expressed transcripts resulted in an accuracy of 99% for DCIS and 98% for IDC. Further, classification based on an automatically identified expression signature covering all ST spots of tissue sections resulted in prediction accuracy of 95% for DCIS and 91% for IDC.

Conclusions: This concept study suggest that the ST signatures learned from expert selected breast cancer tissue sections can be used to identify breast cancer regions in whole tissue sections including regions not trained on. Furthermore, the identified expression signatures can classify cancer regions in tissue sections not used for training with high accuracy. Expert-generated but even automatically generated cancer signatures from ST data might be able to classify breast cancer regions and provide clinical decision support for pathologists in the future.

Keywords: Breast cancer; Cancer diagnosis; Expression signature; Machine learning; Spatial transcriptomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / genetics*
  • Breast Neoplasms / classification
  • Breast Neoplasms / diagnosis*
  • Breast Neoplasms / genetics
  • Carcinoma, Ductal, Breast / diagnosis*
  • Carcinoma, Ductal, Breast / genetics
  • Carcinoma, Intraductal, Noninfiltrating / diagnosis*
  • Carcinoma, Intraductal, Noninfiltrating / genetics
  • Female
  • Humans
  • Machine Learning*
  • Molecular Typing / methods*
  • ROC Curve
  • Spatial Analysis
  • Transcriptome*

Substances

  • Biomarkers, Tumor