Towards large-scale case-finding: training and validation of residual networks for detection of chronic obstructive pulmonary disease using low-dose CT

Lancet Digit Health. 2020 May;2(5):e259-e267. doi: 10.1016/S2589-7500(20)30064-9. Epub 2020 Apr 21.

Abstract

Background: Chronic obstructive pulmonary disease (COPD) is underdiagnosed in the community. Thoracic CT scans are widely used for diagnostic and screening purposes for lung cancer. In this proof-of-concept study, we aimed to evaluate a software pipeline for the automated detection of COPD, based on deep learning and a dataset of low-dose CTs that were performed for early detection of lung cancer.

Methods: We examined the use of deep residual networks, a type of artificial residual network, for the automated detection of COPD. Three versions of the residual networks were independently trained to perform COPD diagnosis using random subsets of CT scans collected from the PanCan study, which enrolled ex-smokers and current smokers at high risk of lung cancer, and evaluated the networks using three-fold cross-validation experiments. External validation was performed using 2153 CT scans acquired from a separate cohort of individuals with COPD in the ECLIPSE study. Spirometric data were used to define COPD, with stages defined according to the GOLD criteria.

Findings: The best performing networks achieved an area under the receiver operating characteristic curve (AUC) of 0·889 (SD 0·017) in three-fold cross-validation experiments. When the same set of networks was applied to the ECLIPSE cohort without any modifications to the trained models, they achieved an AUC of 0·886 (0·017), a positive predictive value of 0·847 (0·056), and a negative predictive value of 0·755 (0·097), which is a greater performance than the best quantitative CT measure, the percentage of lung volumes of less than or equal to -950 Hounsfield units (AUC 0·742).

Interpretation: Our proposed approach could identify patients with COPD among ex-smokers and current smokers without a previous diagnosis of COPD, with clinically acceptable performance. The use of deep residual networks on chest CT scans could be an effective case-finding tool for COPD detection and diagnosis, particularly in ex-smokers and current smokers who are being screened for lung cancer.

Funding: Data Science Institute, University of British Columbia; Canadian Institutes of Health Research.

Trial registration: ClinicalTrials.gov NCT00292552.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Aged
  • Area Under Curve
  • Artificial Intelligence*
  • Canada
  • Cohort Studies
  • Data Analysis
  • Disease Progression
  • Ex-Smokers
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Lung / diagnostic imaging
  • Lung / pathology*
  • Lung Neoplasms / diagnosis
  • Lung Neoplasms / diagnostic imaging
  • Lung Neoplasms / etiology
  • Lung Neoplasms / pathology
  • Male
  • Mass Screening / methods*
  • Middle Aged
  • Models, Biological*
  • Neural Networks, Computer
  • Pulmonary Disease, Chronic Obstructive / diagnosis*
  • Pulmonary Disease, Chronic Obstructive / diagnostic imaging
  • ROC Curve
  • Risk Assessment
  • Smokers
  • Smoking / adverse effects*
  • Tomography, X-Ray Computed / methods

Associated data

  • ClinicalTrials.gov/NCT00292552

Grant support