Radiomics-based machine learning methods for isocitrate dehydrogenase genotype prediction of diffuse gliomas

Shuang Wu; Jin Meng; Qi Yu; Ping Li; Shen Fu

doi:10.1007/s00432-018-2787-1

Radiomics-based machine learning methods for isocitrate dehydrogenase genotype prediction of diffuse gliomas

J Cancer Res Clin Oncol. 2019 Mar;145(3):543-550. doi: 10.1007/s00432-018-2787-1. Epub 2019 Feb 4.

Authors

Shuang Wu^{1

2}, Jin Meng^{1

2}, Qi Yu^{1

2}, Ping Li³, Shen Fu^{4

5

6

7

8}

Affiliations

¹ Department of Radiation Oncology, Fudan University Shanghai Cancer Center, 270 Dong'An Road, Xuhui District, Shanghai, 200032, China.
² Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
³ Department of Radiation Oncology, Shanghai Proton and Heavy Ion Center, Shanghai, 201321, China.
⁴ Department of Radiation Oncology, Fudan University Shanghai Cancer Center, 270 Dong'An Road, Xuhui District, Shanghai, 200032, China. shen_fu@hotmail.com.
⁵ Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, 200032, China. shen_fu@hotmail.com.
⁶ Department of Radiation Oncology, Shanghai Proton and Heavy Ion Center, Shanghai, 201321, China. shen_fu@hotmail.com.
⁷ Key Laboratory of Nuclear Physics and Ion-beam Application (MOE), Fudan University, Shanghai, 200433, China. shen_fu@hotmail.com.
⁸ Department of Radiation Oncology, Shanghai Concord Cancer Hospital, Shanghai, 200020, China. shen_fu@hotmail.com.

Abstract

Purpose: Reliable and accurate predictive models are necessary to drive the success of radiomics. Our aim was to identify the optimal radiomics-based machine learning method for isocitrate dehydrogenase (IDH) genotype prediction in diffuse gliomas.

Methods: Eight classical machine learning methods were evaluated in terms of their stability and performance for pre-operative IDH genotype prediction. A total of 126 patients were enrolled for analysis. Overall, 704 radiomic features extracted from the pre-operative MRI images were analyzed. The patients were randomly assigned to either the training set or the validation set at a ratio of 2:1. Feature selection and classification model training were done using the training set, whereas the predictive performance and stability of the model were independently assessed using the validation set.

Results: Random Forest (RF) showed high predictive performance (accuracy 0.885 ± 0.041, AUC 0.931 ± 0.036), whereas neural network (NN) (accuracy 0.829 ± 0.064, AUC 0.878 ± 0.052) and flexible discriminant analysis (FDA) (accuracy 0.851 ± 0.049, AUC 0.875 ± 0.057) displayed low predictive performance. With regard to stability, RF also showed high robustness against data perturbation (relative standard deviations, RSD 3.87%).

Conclusions: RF is a promising machine learning method in predicting IDH genotype. Development of an accurate and reliable model can assist in the initial diagnostic evaluation and treatment planning for diffuse glioma patients.

Keywords: Diffuse glioma; Isocitrate dehydrogenase; Machine learning; Magnetic resonance imaging; Radiomics.

MeSH terms

Adolescent
Adult
Aged
Aged, 80 and over
Brain Neoplasms / diagnostic imaging
Brain Neoplasms / genetics*
Female
Genotype
Glioma / diagnostic imaging
Glioma / genetics*
Humans
Image Interpretation, Computer-Assisted / methods*
Isocitrate Dehydrogenase / genetics*
Machine Learning*
Magnetic Resonance Imaging
Male
Middle Aged
Young Adult

Substances

Isocitrate Dehydrogenase

Abstract

MeSH terms

Substances

Grants and funding