Differentiating Benign from Malignant Renal Tumors Using T2- and Diffusion-Weighted Images: A Comparison of Deep Learning and Radiomics Models Versus Assessment from Radiologists

J Magn Reson Imaging. 2022 Apr;55(4):1251-1259. doi: 10.1002/jmri.27900. Epub 2021 Aug 30.

Abstract

Background: Differentiating benign from malignant renal tumors is important for selection of the most effective treatment.

Purpose: To develop magnetic resonance imaging (MRI)-based deep learning (DL) models for differentiation of benign and malignant renal tumors and to compare their discrimination performance with the performance of radiomics models and assessment by radiologists.

Study type: Retrospective.

Population: A total of 217 patients were randomly assigned to a training cohort (N = 173) or a testing cohort (N = 44).

Field strength/sequence: Diffusion-weighted imaging (DWI) and fast spin-echo sequence T2-weighted imaging (T2WI) at 3.0T.

Assessment: A radiologist manually labeled the region of interest (ROI) on each image. Three DL models using ResNet-18 architecture and three radiomics models using random forest were developed using T2WI alone, DWI alone, and a combination of the two image sets to discriminate between benign and malignant renal tumors. The diagnostic performance of two radiologists was assessed based on professional experience. We also compared the performance of each model and the radiologists.

Statistical tests: The area under the receiver operating characteristic (ROC) curve (AUC) was used to assess the performance of each model and the radiologists. P < 0.05 indicated statistical significance.

Results: The AUC of the DL models based on T2WI, DWI, and the combination was 0.906, 0.846, and 0.925 in the testing cohorts, respectively. The AUC of the combination DL model was significantly better than that of the models based on individual sequences (0.925 > 0.906, 0.925 > 0.846). The AUC of the radiomics models based on T2WI, DWI, and the combination was 0.824, 0.742, and 0.826 in the testing cohorts, respectively. The AUC of two radiologists was 0.724 and 0.667 in the testing cohorts.

Conclusion: Thus, the MRI-based DL model is useful for differentiating benign from malignant renal tumors in clinic, and the DL model based on T2WI + DWI had the best performance.

Level of evidence: 3 TECHNICAL EFFICACY STAGE: 2.

Keywords: MRI; deep learning; radiomics; renal tumor.

Publication types

  • Randomized Controlled Trial

MeSH terms

  • Deep Learning*
  • Diffusion Magnetic Resonance Imaging
  • Female
  • Humans
  • Kidney Neoplasms* / diagnostic imaging
  • Magnetic Resonance Imaging
  • Male
  • Neuroblastoma*
  • Radiologists
  • Retrospective Studies