Finding gene clusters for a replicated time course study

BMC Res Notes. 2014 Jan 24:7:60. doi: 10.1186/1756-0500-7-60.

Abstract

Background: Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies.

Findings: In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast.

Conclusions: The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Cell Cycle / genetics
  • Cell Cycle Proteins / deficiency
  • Cell Cycle Proteins / genetics
  • Cluster Analysis
  • Gene Expression Profiling / methods
  • Gene Expression Profiling / statistics & numerical data*
  • Gene Expression Regulation, Fungal
  • Gene Knockout Techniques
  • Genes, Fungal
  • Homeodomain Proteins / genetics
  • Multigene Family*
  • Oligonucleotide Array Sequence Analysis / methods
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Pattern Recognition, Automated
  • Regression Analysis
  • Repressor Proteins / deficiency
  • Repressor Proteins / genetics
  • Saccharomyces cerevisiae Proteins / genetics
  • Time Factors

Substances

  • Cell Cycle Proteins
  • Homeodomain Proteins
  • Repressor Proteins
  • Saccharomyces cerevisiae Proteins
  • Yox1 protein, S cerevisiae