Malignancy risk estimation of pulmonary nodules in screening CTs: Comparison between a computer model and human observers

Sarah J van Riel; Francesco Ciompi; Mathilde M Winkler Wille; Asger Dirksen; Stephen Lam; Ernst Th Scholten; Santiago E Rossi; Nicola Sverzellati; Matiullah Naqibullah; Rianne Wittenberg; Marieke C Hovinga-de Boer; Miranda Snoeren; Liesbeth Peters-Bax; Onno Mets; Monique Brink; Mathias Prokop; Cornelia Schaefer-Prokop; Bram van Ginneken

doi:10.1371/journal.pone.0185032

Malignancy risk estimation of pulmonary nodules in screening CTs: Comparison between a computer model and human observers

PLoS One. 2017 Nov 9;12(11):e0185032. doi: 10.1371/journal.pone.0185032. eCollection 2017.

Authors

Sarah J van Riel¹, Francesco Ciompi¹, Mathilde M Winkler Wille², Asger Dirksen³, Stephen Lam⁴, Ernst Th Scholten¹, Santiago E Rossi⁵, Nicola Sverzellati⁶, Matiullah Naqibullah³, Rianne Wittenberg⁷, Marieke C Hovinga-de Boer⁸, Miranda Snoeren¹, Liesbeth Peters-Bax¹, Onno Mets⁹, Monique Brink¹, Mathias Prokop¹, Cornelia Schaefer-Prokop^{1

8}, Bram van Ginneken¹

Affiliations

¹ Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, The Netherlands.
² Department of Diagnostic Imaging, Section of Radiology, Nordsjællands Hospital, Hillerød, Denmark.
³ Department of Pulmonology, Gentofte Hospital, University of Copenhagen, Hellerup, Denmark.
⁴ Department of Integrative Oncology, British Columbia Cancer Agency, Vancouver, Canada.
⁵ Department of Radiology, Centro de Diagnostico Dr Enrique Rossi, Buenos Aires, Argentina.
⁶ Department of Clinical Sciences, Division of Radiology, University Hospital of Parma, Parma, Italy.
⁷ Department of Radiology, Vrije Universiteit Medisch Centrum, Amsterdam, the Netherlands.
⁸ Department of Radiology, Meander Medical Center, Amersfoort, the Netherlands.
⁹ Department of Radiology, UMC Utrecht, Utrecht, the Netherlands.

Abstract

Purpose: To compare human observers to a mathematically derived computer model for differentiation between malignant and benign pulmonary nodules detected on baseline screening computed tomography (CT) scans.

Methods: A case-cohort study design was chosen. The study group consisted of 300 chest CT scans from the Danish Lung Cancer Screening Trial (DLCST). It included all scans with proven malignancies (n = 62) and two subsets of randomly selected baseline scans with benign nodules of all sizes (n = 120) and matched in size to the cancers, respectively (n = 118). Eleven observers and the computer model (PanCan) assigned a malignancy probability score to each nodule. Performances were expressed by area under the ROC curve (AUC). Performance differences were tested using the Dorfman, Berbaum and Metz method. Seven observers assessed morphological nodule characteristics using a predefined list. Differences in morphological features between malignant and size-matched benign nodules were analyzed using chi-square analysis with Bonferroni correction. A significant difference was defined at p < 0.004.

Results: Performances of the model and observers were equivalent (AUC 0.932 versus 0.910, p = 0.184) for risk-assessment of malignant and benign nodules of all sizes. However, human readers performed superior to the computer model for differentiating malignant nodules from size-matched benign nodules (AUC 0.819 versus 0.706, p < 0.001). Large variations between observers were seen for ROC areas and ranges of risk scores. Morphological findings indicative of malignancy referred to border characteristics (spiculation, p < 0.001) and perinodular architectural deformation (distortion of surrounding lung parenchyma architecture, p < 0.001; pleural retraction, p = 0.002).

Conclusions: Computer model and human observers perform equivalent for differentiating malignant from randomly selected benign nodules, confirming the high potential of computer models for nodule risk estimation in population based screening studies. However, computer models highly rely on size as discriminator. Incorporation of other morphological criteria used by human observers to superiorly discriminate size-matched malignant from benign nodules, will further improve computer performance.

Publication types

Comparative Study

MeSH terms

Aged
Female
Humans
Lung Neoplasms / diagnostic imaging*
Male
Mass Screening*
Middle Aged
Probability
Radiographic Image Interpretation, Computer-Assisted*
Risk Factors
Solitary Pulmonary Nodule / diagnostic imaging*
Tomography, X-Ray Computed*

Grants and funding

This project was supported by a research grant of Mevis Medical Solutions AG, Bremen, Germany. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.