Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 14;11(1):1778.
doi: 10.1038/s41467-020-15671-5.

Computational analysis of pathological images enables a better diagnosis of TFE3 Xp11.2 translocation renal cell carcinoma

Affiliations
Free PMC article

Computational analysis of pathological images enables a better diagnosis of TFE3 Xp11.2 translocation renal cell carcinoma

Jun Cheng et al. Nat Commun. .
Free PMC article

Abstract

TFE3 Xp11.2 translocation renal cell carcinoma (TFE3-RCC) generally progresses more aggressively compared with other RCC subtypes, but it is challenging to diagnose TFE3-RCC by traditional visual inspection of pathological images. In this study, we collect hematoxylin and eosin- stained histopathology whole-slide images of 74 TFE3-RCC cases (the largest cohort to date) and 74 clear cell RCC cases (ccRCC, the most common RCC subtype) with matched gender and tumor grade. An automatic computational pipeline is implemented to extract image features. Comparative study identifies 52 image features with significant differences between TFE3-RCC and ccRCC. Machine learning models are built to distinguish TFE3-RCC from ccRCC. Tests of the classification models on an external validation set reveal high accuracy with areas under ROC curve ranging from 0.842 to 0.894. Our results suggest that automatically derived image features can capture subtle morphological differences between TFE3-RCC and ccRCC and contribute to a potential guideline for TFE3-RCC diagnosis.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Workflow scheme.
a H&E-stained tissue slides were digitized by a scanner to obtain whole-slide images. b A large set of quantitative image features were extracted, characterizing nucleus size, staining, shape, and density. c The Mann–Whitney U test was used to compare image features between TFE3-RCC and ccRCC, and machine learning models were developed based on the image features to automatically classify the two cancer subtypes. On the box plots in c, the central mark indicates the median, and the bottom and top edges of the box indicate the 25th and 75th percentiles (q1 and q3), respectively. The upper whisker extends from q3 to q3 + 1.5 × (q3 − q1), and the lower whisker extends from q1 to q1 − 1.5 × (q3 − q1), while data beyond the end of the whiskers are outlying points that are plotted individually.
Fig. 2
Fig. 2. Feature extraction pipeline.
a The nuclei in whole-slide images are automatically segmented. b For each segmented nucleus, 10 nucleus-level features, regarding nucleus size, staining intensity, shape, and density, are extracted. c For each type of nucleus-level features from the same whole-slide image, they are dissected into 15 image-level features using a 10-bin histogram and five distribution statistics. Scale bars: 5 mm (a whole-slide image) and 50 µm (a image patch).
Fig. 3
Fig. 3. Comparison of image features between TFE3-RCC and ccRCC.
For each feature, the fold change is defined as the ratio of the median feature values between ccRCC and TFE3-RCC. 52 image features that show significant differences between TFE3-RCC and ccRCC are identified using the two-sided Mann–Whitney U test. Multiple comparison correction is performed using false discovery rate procedure at 5% level.
Fig. 4
Fig. 4. The performance of the four machine learning models.
a Classification performance on dataset 1 using five-fold cross-validation (n = 5 experiments for each model). For each, 80% of patients were used as the training set and the remaining patients were used as the internal validation set. b Receiver operating characteristics curves for classifying TFE3-RCC and ccRCC in the external validation set (dataset 2). Models were trained using dataset 1 and evaluated using dataset 2. The 95% confidence intervals for the AUC: LR (0.763–0.984), RF (0.736–0.960), SVM-L (0.725–0.959), and SVM-G (0.797–0.991). LR, logistic regression; RF, random forest; SVM-L, SVM with linear kernel; SVM-G, SVM with Gaussian kernel. Data are represented as mean ± SD in a.

Similar articles

Cited by

References

    1. Cheng, L., et al. Urologic Surgical Pathology, 4th ed. (Elsevier, 2019).
    1. MacLennan GT, Cheng L. Five decades of urologic pathology: the accelerating expansion of knowledge in renal cell neoplasia. Hum. Pathol. 2020;95:24–45. doi: 10.1016/j.humpath.2019.09.009. - DOI - PubMed
    1. Moch H, et al. The 2016 WHO classification of tumours of the urinary system and male genital organs-part A: renal, penile, and testicular tumours. Eur. Urol. 2016;70:93–105. doi: 10.1016/j.eururo.2016.02.029. - DOI - PubMed
    1. Ricketts CJ, et al. The Cancer Genome Atlas comprehensive molecular characterization of renal cell carcinoma. Cell Rep. 2018;23:313–326.e315. doi: 10.1016/j.celrep.2018.03.075. - DOI - PMC - PubMed
    1. Argani P. MiT family translocation renal cell carcinoma. Semin. Diagn. Pathol. 2015;32:103–113. doi: 10.1053/j.semdp.2015.02.003. - DOI - PubMed

Publication types

MeSH terms

Substances