The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins
- PMID: 15769473
- DOI: 10.1016/j.jmb.2005.01.071
The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins
Abstract
The structural stability of a protein requires a large number of interresidue interactions. The energetic contribution of these can be approximated by low-resolution force fields extracted from known structures, based on observed amino acid pairing frequencies. The summation of such energies, however, cannot be carried out for proteins whose structure is not known or for intrinsically unstructured proteins. To overcome these limitations, we present a novel method for estimating the total pairwise interaction energy, based on a quadratic form in the amino acid composition of the protein. This approach is validated by the good correlation of the estimated and actual energies of proteins of known structure and by a clear separation of folded and disordered proteins in the energy space it defines. As the novel algorithm has not been trained on unstructured proteins, it substantiates the concept of protein disorder, i.e. that the inability to form a well-defined 3D structure is an intrinsic property of many proteins and protein domains. This property is encoded in their sequence, because their biased amino acid composition does not allow sufficient stabilizing interactions to form. By limiting the calculation to a predefined sequential neighborhood, the algorithm was turned into a position-specific scoring scheme that characterizes the tendency of a given amino acid to fall into an ordered or disordered region. This application we term IUPred and compare its performance with three generally accepted predictors, PONDR VL3H, DISOPRED2 and GlobPlot on a database of disordered proteins.
Similar articles
-
IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content.Bioinformatics. 2005 Aug 15;21(16):3433-4. doi: 10.1093/bioinformatics/bti541. Epub 2005 Jun 14. Bioinformatics. 2005. PMID: 15955779
-
Novel knowledge-based mean force potential at atomic level.J Mol Biol. 1997 Mar 21;267(1):207-22. doi: 10.1006/jmbi.1996.0868. J Mol Biol. 1997. PMID: 9096219
-
Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space.Proteins. 2009 May 15;75(3):682-705. doi: 10.1002/prot.22280. Proteins. 2009. PMID: 19003998
-
[Structured proteins and proteins with internal disorder].Mol Biol (Mosk). 2007 Mar-Apr;41(2):297-313. Mol Biol (Mosk). 2007. PMID: 17514898 Review. Russian.
-
The most important thing is the tail: multitudinous functionalities of intrinsically disordered protein termini.FEBS Lett. 2013 Jun 27;587(13):1891-901. doi: 10.1016/j.febslet.2013.04.042. Epub 2013 May 10. FEBS Lett. 2013. PMID: 23665034 Review.
Cited by
-
Computational identification of MoRFs in protein sequences.Bioinformatics. 2015 Jun 1;31(11):1738-44. doi: 10.1093/bioinformatics/btv060. Epub 2015 Jan 30. Bioinformatics. 2015. PMID: 25637562 Free PMC article.
-
The Pathophysiological Significance of Fibulin-3.Biomolecules. 2020 Sep 8;10(9):1294. doi: 10.3390/biom10091294. Biomolecules. 2020. PMID: 32911658 Free PMC article. Review.
-
Structural architecture of the human long non-coding RNA, steroid receptor RNA activator.Nucleic Acids Res. 2012 Jun;40(11):5034-51. doi: 10.1093/nar/gks071. Epub 2012 Feb 22. Nucleic Acids Res. 2012. PMID: 22362738 Free PMC article.
-
De Novo Regulatory Motif Discovery Identifies Significant Motifs in Promoters of Five Classes of Plant Dehydrin Genes.PLoS One. 2015 Jun 26;10(6):e0129016. doi: 10.1371/journal.pone.0129016. eCollection 2015. PLoS One. 2015. PMID: 26114291 Free PMC article.
-
Single-residue posttranslational modification sites at the N-terminus, C-terminus or in-between: To be or not to be exposed for enzyme access.Proteomics. 2015 Jul;15(14):2525-46. doi: 10.1002/pmic.201400633. Proteomics. 2015. PMID: 26038108 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
