Novel algorithm for coexpression detection in time-varying microarray data sets

IEEE/ACM Trans Comput Biol Bioinform. 2008 Jan-Mar;5(1):120-135. doi: 10.1109/tcbb.2007.1052.

Abstract

When analyzing the results of microarray experiments, biologists generally use unsupervised categorization tools. However, such tools regard each time point as an independent dimension and utilize the Euclidean distance to compute the similarities between expressions. Furthermore, some of these methods require the number of clusters to be determined in advance, which is clearly impossible in the case of a new dataset. Therefore, this study proposes a novel scheme, designated as the Variation-based Coexpression Detection (VCD) algorithm, to analyze the trends of expressions based on their variation over time. The proposed algorithm has two advantages. First, it is unnecessary to determine the number of clusters in advance since the algorithm automatically detects those genes whose profiles are grouped together and creates patterns for these groups. Second, the algorithm features a new measurement criterion for calculating the degree of change of the expressions between adjacent time points and evaluating their trend similarities. Three real-world microarray datasets are employed to evaluate the performance of the proposed algorithm.

MeSH terms

  • Algorithms*
  • Animals
  • Blastocystis hominis / genetics
  • Cluster Analysis
  • Computational Biology / methods
  • Gene Expression Profiling / methods*
  • HT29 Cells
  • Humans
  • Oligonucleotide Array Sequence Analysis / methods*
  • Pattern Recognition, Automated / methods*
  • Protozoan Proteins / genetics
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae Proteins / genetics
  • Schizosaccharomyces / genetics
  • Schizosaccharomyces pombe Proteins / genetics

Substances

  • Protozoan Proteins
  • Saccharomyces cerevisiae Proteins
  • Schizosaccharomyces pombe Proteins