Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Jul;40(7):854-61.
doi: 10.1038/ng.167. Epub 2008 Jun 15.

Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks

Affiliations

Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks

Jun Zhu et al. Nat Genet. 2008 Jul.

Abstract

A key goal of biology is to construct networks that predict complex system behavior. We combine multiple types of molecular data, including genotypic, expression, transcription factor binding site (TFBS), and protein-protein interaction (PPI) data previously generated from a number of yeast experiments, in order to reconstruct causal gene networks. Networks based on different types of data are compared using metrics devised to assess the predictive power of a network. We show that a network reconstructed by integrating genotypic, TFBS and PPI data is the most predictive. This network is used to predict causal regulators responsible for hot spots of gene expression activity in a segregating yeast population. We also show that the network can elucidate the mechanisms by which causal regulators give rise to larger-scale changes in gene expression activity. We then prospectively validate predictions, providing direct experimental evidence that predictive networks can be constructed by integrating multiple, appropriate data types.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A generic approach to identifying clique-communities in the PPI network. A) Hierarchical clustering over the clique-clique similarity matrix heatmap derived from a network of the 492 k-cliques with k ≥ 5. Cliques in the rows and columns are sorted by an agglomerative hierarchical clustering algorithm. The clique network clearly displays strong modularity under hierarchical organization. Each of the colored bars along the top horizontal and left vertical axes represents a network module. B) The clique community network. Each node represents a clique and each link indicates that the two connected cliques have a similarity greater than 0.5 based on the Dice similarity measure. Of the clique communities represented in this plot, 74% overlap the set of stable protein complexes.
Figure 2
Figure 2
eQTL hot spot 4 subnetworks. A) Subnetworks in BNfull enriched for expression traits linked to eQTL hot spot 4. Of the 3,662 genes comprising BNfull, 203 link to eQTL hot spot 4. There are 309 genes comprising the three subnetworks shown here, and 170 of these genes link to eQTL hot spot 4 (red nodes), a nearly 10-fold enrichment over what was expected by chance (empirical p < 10−8). LEU2 and ILV6 were identified as the primary causal regulators for the large subnetwork, as described in the text. ILV6 is supported as causal for GCN4 in the large subnetwork. B) The ILV6 knockout signature is enriched in the large eQTL hot spot 4 subnetwork. Of the 635 genes in the ILV6 knockout signature, 432 were represented in BNfull, and 129 of these overlapped the large eQTL hot spot 4 subnetwork (colored nodes), a nearly 4-fold enrichment over what was expected by chance (empirical p < 10−8). The red and green nodes represent genes that are up- and down-regulated, respectively, in the ILV6 knockout signature (note GCN4 is up-regulated).

Similar articles

Cited by

References

    1. Kulp DC, Jagalur M. Causal inference of regulator-target pairs by gene mapping of expression phenotypes. BMC Genomics. 2006;7:125. - PMC - PubMed
    1. Lum PY, et al. Elucidating the murine brain transcriptional network in a segregating mouse population to identify core functional modules for obesity and diabetes. J Neurochem. 2006;97 1:50–62. - PubMed
    1. Mehrabian M, et al. Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits. Nat Genet. 2005;37:1224–33. - PubMed
    1. Schadt EE, et al. An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005;37:710–7. - PMC - PubMed
    1. Rual JF, et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature. 2005;437:1173–8. - PubMed

Publication types

Substances

Associated data