Subsystem identification through dimensionality reduction of large-scale gene expression data
- PMID: 12840046
- PMCID: PMC403744
- DOI: 10.1101/gr.903503
Subsystem identification through dimensionality reduction of large-scale gene expression data
Abstract
The availability of parallel, high-throughput biological experiments that simultaneously monitor thousands of cellular observables provides an opportunity for investigating cellular behavior in a highly quantitative manner at multiple levels of resolution. One challenge to more fully exploit new experimental advances is the need to develop algorithms to provide an analysis at each of the relevant levels of detail. Here, the data analysis method non-negative matrix factorization (NMF) has been applied to the analysis of gene array experiments. Whereas current algorithms identify relationships on the basis of large-scale similarity between expression patterns, NMF is a recently developed machine learning technique capable of recognizing similarity between subportions of the data corresponding to localized features in expression space. A large data set consisting of 300 genome-wide expression measurements of yeast was used as sample data to illustrate the performance of the new approach. Local features detected are shown to map well to functional cellular subsystems. Functional relationships predicted by the new analysis are compared with those predicted using standard approaches; validation using bioinformatic databases suggests predictions using the new approach may be up to twice as accurate as some conventional approaches.
Figures
Similar articles
-
AVID: an integrative framework for discovering functional relationships among proteins.BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136. BMC Bioinformatics. 2005. PMID: 15929793 Free PMC article.
-
Vector algebra in the analysis of genome-wide expression data.Genome Biol. 2002;3(3):RESEARCH0011. doi: 10.1186/gb-2002-3-3-research0011. Epub 2002 Feb 13. Genome Biol. 2002. PMID: 11897023 Free PMC article.
-
Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions.J Mol Biol. 2001 Dec 14;314(5):1053-66. doi: 10.1006/jmbi.2000.5219. J Mol Biol. 2001. PMID: 11743722
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
-
Imaging data analysis using non-negative matrix factorization.Neurosci Res. 2022 Jun;179:51-56. doi: 10.1016/j.neures.2021.12.001. Epub 2021 Dec 22. Neurosci Res. 2022. PMID: 34953961 Review.
Cited by
-
Integrated analysis of gene expression by Association Rules Discovery.BMC Bioinformatics. 2006 Feb 7;7:54. doi: 10.1186/1471-2105-7-54. BMC Bioinformatics. 2006. PMID: 16464256 Free PMC article.
-
Biclustering of gene expression data by Non-smooth Non-negative Matrix Factorization.BMC Bioinformatics. 2006 Feb 17;7:78. doi: 10.1186/1471-2105-7-78. BMC Bioinformatics. 2006. PMID: 16503973 Free PMC article.
-
Matrix factorization for recovery of biological processes from microarray data.Methods Enzymol. 2009;467:59-77. doi: 10.1016/S0076-6879(09)67003-8. Methods Enzymol. 2009. PMID: 19897089 Free PMC article.
-
THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data.PLoS Genet. 2022 Mar 8;18(3):e1010102. doi: 10.1371/journal.pgen.1010102. eCollection 2022 Mar. PLoS Genet. 2022. PMID: 35259165 Free PMC article.
-
Improving knowledge on the activation of bone marrow fibroblasts in MGUS and MM disease through the automatic extraction of genes via a nonnegative matrix factorization approach on gene expression profiles.J Transl Med. 2018 Aug 3;16(1):217. doi: 10.1186/s12967-018-1589-1. J Transl Med. 2018. PMID: 30075788 Free PMC article.
References
-
- Bittner, M., Meltzer, P., Chen, Y., Jiang, J., Seftor, E., Hendrix, M., Radmacher, M., Simon, R., Yakhini, Z., Ben-Dor, A., et al. 2000. Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature 406: 536-540. - PubMed
-
- Broet, P., Richardson, S., and Radvanyi, F. 2002. Bayesian hierarchical model for identifying changes in gene expression from microarray experiments. J. Comput. Biol. 9: 671-683. - PubMed
WEB SITE REFERENCES
-
- http://mips.gsf.de; Munich Information Center for Protein Sequences.
-
- http://www.incyte.com/; Yeast Proteome Database.
-
- http://www.rii.com/register/cell2000102Hughes/EULA.htm; Source Data at Rosetta Inpharmatics.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases