Correlated sequence-signatures as markers of protein-protein interaction
- PMID: 11518523
- DOI: 10.1006/jmbi.2001.4920
Correlated sequence-signatures as markers of protein-protein interaction
Abstract
As protein-protein interaction is intrinsic to most cellular processes, the ability to predict which proteins in the cell interact can aid significantly in identifying the function of newly discovered proteins, and in understanding the molecular networks they participate in. Here we demonstrate that characteristic pairs of sequence-signatures can be learned from a database of experimentally determined interacting proteins, where one protein contains the one sequence-signature and its interacting partner contains the other sequence-signature. The sequence-signatures that recur in concert in various pairs of interacting proteins are termed correlated sequence-signatures, and it is proposed that they can be used for predicting putative pairs of interacting partners in the cell. We demonstrate the potential of this approach on a comprehensive database of experimentally determined pairs of interacting proteins in the yeast Saccharomyces cerevisiae. The proteins in this database have been characterized by their sequence-signatures, as defined by the InterPro classification. A statistical analysis performed on all possible combinations of sequence-signature pairs has identified those pairs that are over-represented in the database of yeast interacting proteins. It is demonstrated how the use of the correlated sequence-signatures as identifiers of interacting proteins can reduce significantly the search space, and enable directed experimental interaction screens.
Copyright 2001 Academic Press.
Similar articles
-
Large scale statistical prediction of protein-protein interaction by potentially interacting domain (PID) pair.Genome Inform. 2002;13:42-50. Genome Inform. 2002. PMID: 14571373
-
Automated discovery of structural signatures of protein fold and function.J Mol Biol. 2001 Feb 23;306(3):591-605. doi: 10.1006/jmbi.2000.4414. J Mol Biol. 2001. PMID: 11178916
-
Functional grouping based on signatures in protein termini.Proteins. 2006 Jun 1;63(4):996-1004. doi: 10.1002/prot.20903. Proteins. 2006. PMID: 16475191
-
Computational prediction of protein-protein interactions.Methods Mol Biol. 2004;261:445-68. doi: 10.1385/1-59259-762-9:445. Methods Mol Biol. 2004. PMID: 15064475 Review.
-
Protein structure databases with new web services for structural biology and biomedical research.Brief Bioinform. 2008 Jul;9(4):276-85. doi: 10.1093/bib/bbn015. Epub 2008 Apr 22. Brief Bioinform. 2008. PMID: 18430752 Review.
Cited by
-
Characterization and prediction of protein-protein interactions within and between complexes.Proc Natl Acad Sci U S A. 2006 Oct 3;103(40):14718-23. doi: 10.1073/pnas.0603352103. Epub 2006 Sep 26. Proc Natl Acad Sci U S A. 2006. PMID: 17003128 Free PMC article.
-
A domain-based approach to predict protein-protein interactions.BMC Bioinformatics. 2007 Jun 13;8:199. doi: 10.1186/1471-2105-8-199. BMC Bioinformatics. 2007. PMID: 17567909 Free PMC article.
-
Host pathogen protein interactions predicted by comparative modeling.Protein Sci. 2007 Dec;16(12):2585-96. doi: 10.1110/ps.073228407. Epub 2007 Oct 26. Protein Sci. 2007. PMID: 17965183 Free PMC article.
-
False positive reduction in protein-protein interaction predictions using gene ontology annotations.BMC Bioinformatics. 2007 Jul 23;8:262. doi: 10.1186/1471-2105-8-262. BMC Bioinformatics. 2007. PMID: 17645798 Free PMC article.
-
PreSPI: a domain combination based prediction system for protein-protein interaction.Nucleic Acids Res. 2004 Dec 1;32(21):6312-20. doi: 10.1093/nar/gkh972. Print 2004. Nucleic Acids Res. 2004. PMID: 15576357 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
