Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks

Bioinformatics. 2008 Nov 1;24(21):2491-7. doi: 10.1093/bioinformatics/btn482. Epub 2008 Sep 10.

Abstract

Motivation: We present PCIT, an algorithm for the reconstruction of gene co-expression networks (GCN) that combines the concept partial correlation coefficient with information theory to identify significant gene to gene associations defining edges in the reconstruction of GCN. The properties of PCIT are examined in the context of the topology of the reconstructed network including connectivity structure, clustering coefficient and sensitivity.

Results: We apply PCIT to a series of simulated datasets with varying levels of complexity in terms of number of genes and experimental conditions, as well as to three real datasets. Results show that, as opposed to the constant cutoff approach commonly used in the literature, the PCIT algorithm can identify and allow for more moderate, yet not less significant, estimates of correlation (r) to still establish a connection in the GCN. We show that PCIT is more sensitive than established methods and capable of detecting functionally validated gene-gene interactions coming from absolute r values as low as 0.3. These bona fide associations, which often relate to genes with low variation in expression patterns, are beyond the detection limits of conventional fixed-threshold methods, and would be overlooked by studies relying on those methods.

Availability: FORTRAN 90 source code to perform the PCIT algorithm is available as Supplementary File 1.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Cattle
  • Computer Simulation
  • Databases, Genetic
  • Gene Expression Regulation*
  • Gene Regulatory Networks*
  • Humans
  • Information Theory
  • Oligonucleotide Array Sequence Analysis