Leveraging existing biological knowledge in the identification of candidate genes for facial dysmorphology

BMC Bioinformatics. 2009 Feb 5;10 Suppl 2(Suppl 2):S12. doi: 10.1186/1471-2105-10-S2-S12.

Abstract

Background: In response to the frequently overwhelming output of high-throughput microarray experiments, we propose a methodology to facilitate interpretation of biological data in the context of existing knowledge. Through the probabilistic integration of explicit and implicit data sources a functional interaction network can be constructed. Each edge connecting two proteins is weighted by a confidence value capturing the strength and reliability of support for that interaction given the combined data sources. The resulting network is examined in conjunction with expression data to identify groups of genes with significant temporal or tissue specific patterns. In contrast to unstructured gene lists, these networks often represent coherent functional groupings.

Results: By linking from shared functional categorizations to primary biological resources we apply this method to craniofacial microarray data, generating biologically testable hypotheses and identifying candidate genes for craniofacial development.

Conclusion: The novel methodology presented here illustrates how the effective integration of pre-existing biological knowledge and high-throughput experimental data drives biological discovery and hypothesis generation.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Craniofacial Abnormalities / genetics*
  • Craniofacial Abnormalities / metabolism
  • Databases, Genetic
  • Gene Expression Profiling / methods
  • Gene Regulatory Networks*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Pattern Recognition, Automated