Influence of segmentation margin on machine learning-based high-dimensional quantitative CT texture analysis: a reproducibility study on renal clear cell carcinomas

Burak Kocak; Ece Ates; Emine Sebnem Durmaz; Melis Baykara Ulusan; Ozgur Kilickesmez

doi:10.1007/s00330-019-6003-8

Influence of segmentation margin on machine learning-based high-dimensional quantitative CT texture analysis: a reproducibility study on renal clear cell carcinomas

Eur Radiol. 2019 Sep;29(9):4765-4775. doi: 10.1007/s00330-019-6003-8. Epub 2019 Feb 12.

Authors

Burak Kocak¹, Ece Ates², Emine Sebnem Durmaz³, Melis Baykara Ulusan², Ozgur Kilickesmez²

Affiliations

¹ Department of Radiology, Istanbul Training and Research Hospital, Istanbul, Turkey. drburakkocak@gmail.com.
² Department of Radiology, Istanbul Training and Research Hospital, Istanbul, Turkey.
³ Department of Radiology, Cerrahpasa Medical Faculty, Istanbul University-Cerrahpasa, Istanbul, Turkey.

PMID: 30747300
DOI: 10.1007/s00330-019-6003-8

Abstract

Objective: To determine the possible influence of segmentation margin on each step (feature reproducibility, selection, and classification) of the machine learning (ML)-based high-dimensional quantitative computed tomography (CT) texture analysis (qCT-TA) of renal clear cell carcinomas (RcCCs).

Materials and methods: For this retrospective study, 47 patients with RcCC were included from a public database. Two segmentations were obtained by two radiologists for each tumour: (i) contour-focused and (ii) margin shrinkage of 2 mm. Texture features were extracted from original, filtered, and transformed CT images. Feature selection was done using a correlation-based algorithm. The ML classifier was k-nearest neighbours. Classifications were performed with and without using synthetic minority over-sampling technique. Reference standard was nuclear grade (low versus high). Intraclass correlation coefficient (ICC), Pearson's correlation coefficient, Wilcoxon signed-ranks test, and McNemar's test were used in the analysis.

Results: The segmentation with margin shrinkage of 2 mm (772 of 828; 93.2%) yielded more texture features with excellent reproducibility (ICC ≥ 0.9) than contour-focused segmentation (714 of 828; 86.2%), p < 0.0001. The feature selection algorithms resulted in different feature subsets for two segmentation datasets with only one common feature. All ML-based models based on contour-focused segmentation (area under the curve [AUC] range, 0.865-0.984) performed better than those with margin shrinkage of 2 mm (AUC range, 0.745-0.887), p < 0.05.

Conclusions: Each step of the ML-based high-dimensional qCT-TA was susceptible to a slight change of 2 mm in segmentation margin. Despite yielding fewer features with excellent reproducibility, use of the contour-focused segmentation provided better classification performance for distinguishing nuclear grade.

Key points: • Each step of a machine learning (ML)-based high-dimensional quantitative computed tomography texture analysis (qCT-TA) is sensitive to even a slight change of 2 mm in segmentation margin. • Despite yielding fewer texture features with excellent reproducibility, performing the segmentation focusing on the outermost boundary of the tumours provides better classification performance in ML-based qCT-TA of renal clear cell carcinomas for distinguishing nuclear grade. • Findings of an ML-based high-dimensional qCT-TA may not be reproducible in clinical practice even using the same feature selection algorithm and ML classifier unless the possible influence of the segmentation margin is considered.

Keywords: Artificial intelligence; Clear cell renal cell carcinoma; Machine learning; Multidetector computed tomography; Radiomics.

MeSH terms

Algorithms
Carcinoma, Renal Cell / diagnostic imaging*
Carcinoma, Renal Cell / pathology
Diagnosis, Differential
Female
Humans
Kidney Neoplasms / diagnostic imaging*
Kidney Neoplasms / pathology
Machine Learning*
Male
Middle Aged
Reproducibility of Results
Retrospective Studies
Tomography, X-Ray Computed / methods*