Identifying EGFR mutations in lung adenocarcinoma by noninvasive imaging using radiomics features and random forest modeling

Eur Radiol. 2019 Sep;29(9):4742-4750. doi: 10.1007/s00330-019-06024-y. Epub 2019 Feb 18.

Abstract

Objectives: The tyrosine kinase inhibitor (TKI)-sensitive mutations of the epidermal growth factor receptor (EGFR) gene is essential in the treatment of lung adenocarcinoma. To overcome the difficulty of EGFR gene test in situations where surgery and biopsy samples are too risky to obtain, we tried a noninvasive imaging method using radiomics features and random forest models.

Methods: Five hundred three lung adenocarcinoma patients who received surgery-based treatment were included in this study. The diagnosis and EGFR gene test were based on resections. TKI-sensitive mutations were found in 60.8% of the patients. CT scans before any invasive operation were gathered and analyzed to extract quantitative radiomics features and build random forest classifiers to identify EGFR mutants from wild types. Clinical features (sex and smoking history) were added to the image-based model. The model was trained on a set of 345 patients and validated on an independent test group (n = 158) using the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity.

Results: The performance of the random forest model with 94 radiomics features reached an AUC of 0.802. Its AUC was further improved to 0.828 by adding sex and smoking history. The sensitivity and specificity are 60.6% and 85.1% at the best diagnostic decision point.

Conclusion: Our results showed that radiomics could not only reflect the genetic differences among tumors but also have diagnostic value and the potential to be a diagnostic tool.

Key points: • Radiomics provides a potential noninvasive method for the prediction of EGFR mutation status. • In situations where surgeries and biopsy are not available, CT image-based radiomics models could help to make treatment decisions. • The accuracy, sensitivity, and specificity still need to be improved before the image-based EGFR identifier could be used in clinics.

Keywords: Epidermal growth factor receptor (EGFR); Non-small cell lung cancer (NSCLC); Radiomics; Random forest.

MeSH terms

  • Adenocarcinoma of Lung / diagnostic imaging*
  • Adenocarcinoma of Lung / genetics*
  • ErbB Receptors / genetics
  • Female
  • Humans
  • Lung Neoplasms / diagnostic imaging*
  • Lung Neoplasms / genetics*
  • Male
  • Middle Aged
  • Mutation*
  • ROC Curve
  • Retrospective Studies
  • Sensitivity and Specificity
  • Tomography, X-Ray Computed / methods*

Substances

  • EGFR protein, human
  • ErbB Receptors