Gene Expression Classification of Lung Adenocarcinoma into Molecular Subtypes

IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1187-1197. doi: 10.1109/TCBB.2019.2905553. Epub 2019 Mar 18.

Abstract

As one of the most common malignancies in the world, lung adenocarcinoma (LUAD) is currently difficult to cure. However, the advent of precision medicine provides an opportunity to improve the treatment of lung cancer. Subtyping lung cancer plays an important role in performing a specific treatment. Here, we developed a framework that combines k-means clustering, t-test, sensitivity analysis, self-organizing map (SOM) neural network, and hierarchical clustering methods to classify LUAD into four subtypes. We determined that 24 differentially expressed genes could be used as therapeutic targets, and five genes (i.e., RTKN2, ADAM6, SPINK1, COL3A1, and COL1A2) could be potential novel markers for LUAD. Multivariate analysis showed that the four subtypes could serve as prognostic subtypes. Representative genes of each subtype were also identified, which could be potentially targetable markers for the different subtypes. The function and pathway enrichment analyses of these representative genes showed that the four subtypes have different pathological mechanisms. Mutations associated with the subtypes, e.g., epidermal growth factor receptor (EGFR) mutations in subtype 4 and tumor protein p53 (TP53) mutations in subtypes 1 and 2, could serve as potential markers for drug development. The four subtypes provide a foundation for subtype-specific therapy of LUAD.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma of Lung* / classification
  • Adenocarcinoma of Lung* / genetics
  • Adenocarcinoma of Lung* / metabolism
  • Algorithms
  • Biomarkers, Tumor / analysis
  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism
  • Cluster Analysis
  • Gene Expression Profiling / methods*
  • Humans
  • Lung Neoplasms* / classification
  • Lung Neoplasms* / genetics
  • Lung Neoplasms* / metabolism
  • Mutation / genetics
  • Neural Networks, Computer*
  • Transcriptome / genetics

Substances

  • Biomarkers, Tumor