Histopathological image and gene expression pattern analysis for predicting molecular features and prognosis of head and neck squamous cell carcinoma

Cancer Med. 2021 Jul;10(13):4615-4628. doi: 10.1002/cam4.3965. Epub 2021 May 13.

Abstract

Background: Histopathological image features offer a quantitative measurement of cellular morphology, and probably help for better diagnosis and prognosis in head and neck squamous cell carcinoma (HNSCC).

Methods: We first used histopathological image features and machine-learning algorithms to predict molecular features of 212 HNSCC patients from The Cancer Genome Atlas (TCGA). Next, we divided TCGA-HNSCC cohort into training set (n = 149) and test set (n = 63), and obtained tissue microarrays as an external validation set (n = 126). We identified the gene expression profile correlated to image features by bioinformatics analysis.

Results: Histopathological image features combined with random forest may predict five somatic mutations, transcriptional subtypes, and methylation subtypes, with area under curve (AUC) ranging from 0.828 to 0.968. The prediction model based on image features could predict overall survival, with 5-year AUC of 0.831, 0.782, and 0.751 in training, test, and validation sets. We next established an integrative prognostic model of image features and gene expressions, which obtained better performance in training set (5-year AUC = 0.860) and test set (5-year AUC = 0.826). According to histopathological transcriptomics risk score (HTRS) generated by the model, high-risk and low-risk patients had different survival in training set (HR = 4.09, p < 0.001) and test set (HR=3.08, p = 0.019). Multivariate analysis suggested that HTRS was an independent predictor in training set (HR = 5.17, p < 0.001). The nomogram combining HTRS and clinical factors had higher net benefit than conventional clinical evaluation.

Conclusions: Histopathological image features provided a promising approach to predict mutations, molecular subtypes, and prognosis of HNSCC. The integration of image features and gene expression data had potential for improving prognosis prediction in HNSCC.

Keywords: head and neck cancer; histopathological images; machine learning; transcriptomics.

MeSH terms

  • Area Under Curve
  • Class I Phosphatidylinositol 3-Kinases / genetics
  • DNA Methylation
  • Female
  • Gene Expression Profiling
  • Genes, p16
  • Genes, p53
  • Head and Neck Neoplasms / genetics*
  • Head and Neck Neoplasms / mortality
  • Head and Neck Neoplasms / pathology*
  • Histone-Lysine N-Methyltransferase / genetics
  • Humans
  • Machine Learning
  • Male
  • Middle Aged
  • Mutation
  • Nomograms*
  • Prognosis
  • Receptor, Notch1 / genetics
  • Squamous Cell Carcinoma of Head and Neck / genetics*
  • Squamous Cell Carcinoma of Head and Neck / mortality
  • Squamous Cell Carcinoma of Head and Neck / pathology*
  • Tissue Array Analysis

Substances

  • NOTCH1 protein, human
  • Receptor, Notch1
  • Histone-Lysine N-Methyltransferase
  • NSD1 protein, human
  • Class I Phosphatidylinositol 3-Kinases
  • PIK3CA protein, human