CT-based radiomics and machine learning to predict spread through air space in lung adenocarcinoma

Eur Radiol. 2020 Jul;30(7):4050-4057. doi: 10.1007/s00330-020-06694-z. Epub 2020 Feb 28.

Abstract

Purpose: Spread through air space (STAS) is a novel invasive pattern of lung adenocarcinoma and is also a risk factor for recurrence and worse prognosis of lung adenocarcinoma. The aims of this study are to develop and validate a computed tomography (CT)‑based radiomics model for preoperative prediction of STAS in lung adenocarcinoma.

Methods and materials: This retrospective study was approved by an institutional review board and included 462 (mean age, 58.06 years) patients with pathologically confirmed lung adenocarcinoma. STAS was identified in 90 patients (19.5%). Two experienced radiologists segmented and extracted radiomics features on preoperative thin-slice CT images with radiomics extension independently. Intraclass correlation coefficients (ICC) and Pearson's correlation were used to rule out those low reliable (ICC < 0.75) and redundant (r > 0.9) features. Univariate logistic regression was applied to select radiomics features which were associated with STAS. A radiomics-based machine learning predictive model using a random forest (RF) was developed and calibrated with fivefold cross-validation. The diagnostic performance of the model was measured by the area under the curve (AUC) of receiver operating characteristic (ROC).

Results: With univariate analysis, 12 radiomics features and age were found to be associated with STAS significantly. The RF model achieved an AUC of 0.754 (a sensitivity of 0.880 and a specificity of 0.588) for predicting STAS.

Conclusion: CT-based radiomics model can preoperatively predict STAS in lung adenocarcinoma with good diagnosis performance.

Key points: • CT-based radiomics and machine learning model can predict spread through air space (STAS) in lung adenocarcinoma with high accuracy. • The random forest (RF) model achieved an AUC of 0.754 (a sensitivity of 0.880 and a specificity of 0.588) for predicting STAS.

Keywords: Adenocarcinoma; Lung; Machine learning; Metastasis; Radiomics.

Publication types

  • Validation Study

MeSH terms

  • Adenocarcinoma of Lung / diagnostic imaging*
  • Adenocarcinoma of Lung / pathology*
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Logistic Models
  • Lung Neoplasms / diagnostic imaging*
  • Lung Neoplasms / pathology*
  • Machine Learning*
  • Male
  • Middle Aged
  • Neoplasm Invasiveness
  • Neoplasm Recurrence, Local
  • ROC Curve
  • Retrospective Studies
  • Risk Factors
  • Sensitivity and Specificity
  • Tomography, X-Ray Computed / methods*