Data mining methods in Omics-based biomarker discovery

Methods Mol Biol. 2011:719:511-26. doi: 10.1007/978-1-61779-027-0_24.


The advent of Omics technologies as genomics and proteomics has brought the hope of discovering novel biomarkers that can be used to diagnose, predict, and monitor the progress of disease. The importance of data mining to identify biological markers for the diagnostic classification and prognostic assessment in the context of microarray and proteomic data has been increasingly recognized. We present an overview of general data mining methods and their applications to biomarker discovery with particular focus on genomics and proteomics data. Two case studies are exemplarily presented, and relevant data mining terminology and techniques are explained.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Biomarkers / metabolism
  • Chromatography, Liquid
  • Colorectal Neoplasms / genetics
  • Colorectal Neoplasms / metabolism
  • Computational Biology / methods*
  • Data Display
  • Data Mining / methods*
  • Decision Trees
  • Humans
  • Neural Networks, Computer
  • Protein Interaction Mapping
  • Proteomics
  • Tandem Mass Spectrometry


  • Biomarkers