The robustness of radiomic texture analysis across different manufacturers of mammography imaging systems is investigated. We quantified feature robustness across mammography manufacturers using a dataset of 111 women who underwent consecutive screening mammography on both general electric and Hologic systems. In each mammogram, a square region of interest (ROI) directly behind the nipple was manually selected. Radiomic features describing parenchymal patterns were automatically extracted on each ROI. Feature comparisons were conducted between manufacturers (and breast densities) using newly developed robustness metrics descriptive of correlation, equivalence, and variability. By examining the distribution of these metric values, we propose the following selection criteria to guide feature evaluation in this dataset: (1) [Formula: see text] of feature ratios [Formula: see text], (2) standard deviation of feature ratios [Formula: see text], (3) correlation of features [Formula: see text], and (4) [Formula: see text]. Statistically significant correlation coefficients ranged from 0.13 to 0.68 in comparisons between the two mammographic systems tested. Features describing spatial patterns tended to exhibit high correlation coefficients, while intensity- and directionality-based features had comparatively poor correlation. Our proposed robustness metrics may be used to evaluate other datasets, for which different ranges of metric values may be appropriate.
Keywords: breast cancer; mammography; quantitative imaging; radiomics; robustness.