Increasing the power to detect causal associations by combining genotypic and expression data in segregating populations
- PMID: 17432931
- PMCID: PMC1851982
- DOI: 10.1371/journal.pcbi.0030069
Increasing the power to detect causal associations by combining genotypic and expression data in segregating populations
Abstract
To dissect common human diseases such as obesity and diabetes, a systematic approach is needed to study how genes interact with one another, and with genetic and environmental factors, to determine clinical end points or disease phenotypes. Bayesian networks provide a convenient framework for extracting relationships from noisy data and are frequently applied to large-scale data to derive causal relationships among variables of interest. Given the complexity of molecular networks underlying common human disease traits, and the fact that biological networks can change depending on environmental conditions and genetic factors, large datasets, generally involving multiple perturbations (experiments), are required to reconstruct and reliably extract information from these networks. With limited resources, the balance of coverage of multiple perturbations and multiple subjects in a single perturbation needs to be considered in the experimental design. Increasing the number of experiments, or the number of subjects in an experiment, is an expensive and time-consuming way to improve network reconstruction. Integrating multiple types of data from existing subjects might be more efficient. For example, it has recently been demonstrated that combining genotypic and gene expression data in a segregating population leads to improved network reconstruction, which in turn may lead to better predictions of the effects of experimental perturbations on any given gene. Here we simulate data based on networks reconstructed from biological data collected in a segregating mouse population and quantify the improvement in network reconstruction achieved using genotypic and gene expression data, compared with reconstruction using gene expression data alone. We demonstrate that networks reconstructed using the combined genotypic and gene expression data achieve a level of reconstruction accuracy that exceeds networks reconstructed from expression data alone, and that fewer subjects may be required to achieve this superior reconstruction accuracy. We conclude that this integrative genomics approach to reconstructing networks not only leads to more predictive network models, but also may save time and money by decreasing the amount of data that must be generated under any given condition of interest to construct predictive network models.
Conflict of interest statement
Figures
Similar articles
-
Alternative pathway approach for automating analysis and validation of cell perturbation networks and design of perturbation experiments.Ann N Y Acad Sci. 2007 Dec;1115:267-85. doi: 10.1196/annals.1407.011. Epub 2007 Oct 9. Ann N Y Acad Sci. 2007. PMID: 17925355
-
An integrative genomics approach to the reconstruction of gene networks in segregating populations.Cytogenet Genome Res. 2004;105(2-4):363-74. doi: 10.1159/000078209. Cytogenet Genome Res. 2004. PMID: 15237224
-
Fitting a geometric graph to a protein-protein interaction network.Bioinformatics. 2008 Apr 15;24(8):1093-9. doi: 10.1093/bioinformatics/btn079. Epub 2008 Mar 14. Bioinformatics. 2008. PMID: 18344248
-
Exploiting naturally occurring DNA variation and molecular profiling data to dissect disease and drug response traits.Curr Opin Biotechnol. 2005 Dec;16(6):647-54. doi: 10.1016/j.copbio.2005.10.005. Epub 2005 Oct 24. Curr Opin Biotechnol. 2005. PMID: 16249078 Review.
-
Reconstruction of microbial transcriptional regulatory networks.Curr Opin Biotechnol. 2004 Feb;15(1):70-7. doi: 10.1016/j.copbio.2003.11.002. Curr Opin Biotechnol. 2004. PMID: 15102470 Review.
Cited by
-
Critical reasoning on causal inference in genome-wide linkage and association studies.Trends Genet. 2010 Dec;26(12):493-8. doi: 10.1016/j.tig.2010.09.002. Epub 2010 Oct 15. Trends Genet. 2010. PMID: 20951462 Free PMC article.
-
Cross-species systems analysis identifies gene networks differentially altered by sleep loss and depression.Sci Adv. 2018 Jul 25;4(7):eaat1294. doi: 10.1126/sciadv.aat1294. eCollection 2018 Jul. Sci Adv. 2018. PMID: 30050989 Free PMC article.
-
Using gene expression to investigate the genetic basis of complex disorders.Hum Mol Genet. 2008 Oct 15;17(R2):R129-34. doi: 10.1093/hmg/ddn285. Hum Mol Genet. 2008. PMID: 18852201 Free PMC article. Review.
-
Regulation of cell distancing in peri-plaque glial nets by Plexin-B1 affects glial activation and amyloid compaction in Alzheimer's disease.Nat Neurosci. 2024 Aug;27(8):1489-1504. doi: 10.1038/s41593-024-01664-w. Epub 2024 May 27. Nat Neurosci. 2024. PMID: 38802590
-
Network Preservation Analysis Reveals Dysregulated Metabolic Pathways in Human Vascular Smooth Muscle Cell Phenotypic Switching.Circ Genom Precis Med. 2023 Aug;16(4):372-381. doi: 10.1161/CIRCGEN.122.003781. Epub 2023 Jun 30. Circ Genom Precis Med. 2023. PMID: 37387208 Free PMC article.
References
-
- Pearl J. Probabilistic reasoning in intelligent systems: Networks of plausible inference. San Mateo (California): Morgan Kaufmann Publishers. p. xix; 1988. 552
-
- Pe'er D, Regev A, Elidan G, Friedman N. Inferring subnetworks from perturbed expression profiles. Bioinformatics. 2001;17(Supplement 1):S215–S224. - PubMed
-
- Sachs K, Perez O, Pe'er D, Lauffenburger DA, Nolan GP. Causal protein-signaling networks derived from multiparameter single-cell data. Science. 2005;308:523–529. - PubMed
-
- Zhu J, Lum PY, Lamb J, GuhaThakurta D, Edwards SW, et al. An integrative genomics approach to the reconstruction of gene networks in segregating populations. Cytogenet Genome Res. 2004;105:363–374. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
