Exploring Histological Similarities Across Cancers From a Deep Learning Perspective

Ashish Menon; Piyush Singh; P K Vinod; C V Jawahar

doi:10.3389/fonc.2022.842759

Exploring Histological Similarities Across Cancers From a Deep Learning Perspective

Front Oncol. 2022 Mar 30:12:842759. doi: 10.3389/fonc.2022.842759. eCollection 2022.

Authors

Ashish Menon¹, Piyush Singh¹, P K Vinod², C V Jawahar¹

Affiliations

¹ Center for Visual Information Technology, International Institute of Information Technology (IIIT) Hyderabad, Hyderabad, India.
² Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology (IIIT) Hyderabad, Hyderabad, India.

Abstract

Histopathology image analysis is widely accepted as a gold standard for cancer diagnosis. The Cancer Genome Atlas (TCGA) contains large repositories of histopathology whole slide images spanning several organs and subtypes. However, not much work has gone into analyzing all the organs and subtypes and their similarities. Our work attempts to bridge this gap by training deep learning models to classify cancer vs. normal patches for 11 subtypes spanning seven organs (9,792 tissue slides) to achieve high classification performance. We used these models to investigate their performances in the test set of other organs (cross-organ inference). We found that every model had a good cross-organ inference accuracy when tested on breast, colorectal, and liver cancers. Further, high accuracy is observed between models trained on the cancer subtypes originating from the same organ (kidney and lung). We also validated these performances by showing the separability of cancer and normal samples in a high-dimensional feature space. We further hypothesized that the high cross-organ inferences are due to shared tumor morphologies among organs. We validated the hypothesis by showing the overlap in the Gradient-weighted Class Activation Mapping (GradCAM) visualizations and similarities in the distributions of nuclei features present within the high-attention regions.

Keywords: TCGA; cancer classification; class activation map (CAM); cross-organ inference; deep learning; histopathology; tissue morphology.