Models for Predicting Stage in Head and Neck Squamous Cell Carcinoma Using Proteomic and Transcriptomic Data

IEEE J Biomed Health Inform. 2017 Jan;21(1):246-253. doi: 10.1109/JBHI.2015.2489158. Epub 2015 Oct 8.

Abstract

Late diagnosis is one of the reasons that head and neck squamous cell carcinoma (HNSCC) patients experience relative five-year survival rates ranging from 40%-66%. The molecular-level differences between early and advanced stage HNSCC may provide insight into therapeutic targets and strategies. Previous bioinformatics studies have shown mixed or limited results in identifying gene and protein markers and in developing models for discriminating between early and advanced stage HNSCC. Thus, we have investigated models for HNSCC stage prediction using RNAseq and reverse phase protein array data from The Cancer Genome Atlas and The Cancer Proteome Atlas. We systematically assessed individual and ensemble binary classifiers, using filter and wrapper feature selection methods, to develop several well-performing models. In particular, integrated models harnessing both data types consistently resulted in better performance. This study identifies informative protein and gene feature sets which may increase understanding of HNSCC progression.

MeSH terms

  • Carcinoma, Squamous Cell / diagnosis*
  • Carcinoma, Squamous Cell / genetics*
  • Carcinoma, Squamous Cell / metabolism
  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Head and Neck Neoplasms / diagnosis*
  • Head and Neck Neoplasms / genetics*
  • Head and Neck Neoplasms / metabolism
  • Humans
  • Models, Statistical
  • Proteome / analysis
  • Proteome / genetics*
  • Proteome / metabolism
  • Sequence Analysis, RNA
  • Squamous Cell Carcinoma of Head and Neck
  • Support Vector Machine
  • Transcriptome / genetics*

Substances

  • Proteome