Toward Understanding the Size Dependence of Shape Features for Predicting Spiculation in Lung Nodules for Computer-Aided Diagnosis

J Digit Imaging. 2015 Dec;28(6):704-17. doi: 10.1007/s10278-015-9774-8.

Abstract

We analyze the importance of shape features for predicting spiculation ratings assigned by radiologists to lung nodules in computed tomography (CT) scans. Using the Lung Image Database Consortium (LIDC) data and classification models based on decision trees, we demonstrate that the importance of several shape features increases disproportionately relative to other image features with increasing size of the nodule. Our shaped-based classification results show an area under the receiver operating characteristic (ROC) curve of 0.65 when classifying spiculation for small nodules and an area of 0.91 for large nodules, resulting in a 26% difference in classification performance using shape features. An analysis of the results illustrates that this change in performance is driven by features that measure boundary complexity, which perform well for large nodules but perform relatively poorly and do no better than other features for small nodules. For large nodules, the roughness of the segmented boundary maps well to the semantic concept of spiculation. For small nodules, measuring directly the complexity of hard segmentations does not yield good results for predicting spiculation due to limits imposed by spatial resolution and the uncertainty in boundary location. Therefore, a wider range of features, including shape, texture, and intensity features, are needed to predict spiculation ratings for small nodules. A further implication is that the efficacy of shape features for a particular classifier used to create computer-aided diagnosis systems depends on the distribution of nodule sizes in the training and testing sets, which may not be consistent across different research studies.

Keywords: Biomedical image analysis; Computed tomography; Computer analysis; Computer-aided diagnosis (CAD); Decision support; Decision trees; Diagnostic imaging; Image interpretation; LIDC.

MeSH terms

  • Humans
  • Lung / diagnostic imaging
  • Lung Neoplasms / diagnostic imaging*
  • ROC Curve
  • Radiographic Image Interpretation, Computer-Assisted / methods*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Solitary Pulmonary Nodule / diagnostic imaging*
  • Tomography, X-Ray Computed / methods*