Siamese neural networks for the classification of high-dimensional radiomic features

Abhishaike Mahajan; James Dormer; Qinmei Li; Deji Chen; Zhenfeng Zhang; Baowei Fei

doi:10.1117/12.2549389

Siamese neural networks for the classification of high-dimensional radiomic features

Proc SPIE Int Soc Opt Eng. 2020 Feb:11314:113143Q. doi: 10.1117/12.2549389. Epub 2020 Mar 16.

Authors

Abhishaike Mahajan^{1

2}, James Dormer¹, Qinmei Li^{1

3}, Deji Chen³, Zhenfeng Zhang³, Baowei Fei^{1

4}

Affiliations

¹ Department of Bioengineering, University of Texas at Dallas, Richardson, TX.
² Department of Cognition and Neuroscience, University of Texas at Dallas, Richardson, TX.
³ Department of Radiology, The Second Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong, China.
⁴ Department of Radiology and Advanced Imaging Research Center, University of Texas Southwestern Medical Center, Dallas, TX.

Abstract

This study demonstrates that a variant of a Siamese neural network architecture is more effective at classifying high-dimensional radiomic features (extracted from T2 MRI images) than traditional models, such as a Support Vector Machine or Discriminant Analysis. Ninety-nine female patients, between the ages of 20 and 48, were imaged with T2 MRI. Using biopsy pathology, the patients were separated into two groups: those with breast cancer (N=55) and those with GLM (N=44). Lesions were segmented by a trained radiologist and the ROIs were used for radiomic feature extraction. The radiomic features include 536 published features from Aerts et al., along with 20 features recurrent quantification analysis features. A Student T-Test was used to select features found to be statistically significant between the two patient groups. These features were then used to train a Siamese neural network. The label given to test features was the label of whichever class the test features with the highest percentile similarity within the training group. Within the two highest-dimensional feature sets, the Siamese network produced an AUC of 0.853 and 0.894, respectively. This is compared to best non-Siamese model, Discriminant Analysis, which produced an AUC of 0.823 and 0.836 for the two respective feature sets. However, when it came to the lower-dimensional recurrent features and the top-20 most significant features from Aerts et al., the Siamese network performed on-par or worse than the competing models. The proposed Siamese neural network architecture can outperform competing other models in high-dimensional, low-sample size spaces with regards to tabular data.

Keywords: Breast Cancer; Disease Classification; MRI; Machine Learning; Mastitis; Neural Network; Radiomics; Siamese Network.

Abstract

Grants and funding