Cascaded multiple classifiers for secondary structure prediction
- PMID: 10892809
- PMCID: PMC2144653
- DOI: 10.1110/ps.9.6.1162
Cascaded multiple classifiers for secondary structure prediction
Abstract
We describe a new classifier for protein secondary structure prediction that is formed by cascading together different types of classifiers using neural networks and linear discrimination. The new classifier achieves an accuracy of 76.7% (assessed by a rigorous full Jack-knife procedure) on a new nonredundant dataset of 496 nonhomologous sequences (obtained from G.J. Barton and J.A. Cuff). This database was especially designed to train and test protein secondary structure prediction methods, and it uses a more stringent definition of homologous sequence than in previous studies. We show that it is possible to design classifiers that can highly discriminate the three classes (H, E, C) with an accuracy of up to 78% for beta-strands, using only a local window and resampling techniques. This indicates that the importance of long-range interactions for the prediction of beta-strands has been probably previously overestimated.
Similar articles
-
Prediction of protein secondary structure by neural networks: encoding short and long range patterns of amino acid packing.Acta Biochim Pol. 1992;39(4):369-92. Acta Biochim Pol. 1992. PMID: 1293893
-
Combining the GOR V algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence.Proteins. 2002 Nov 1;49(2):154-66. doi: 10.1002/prot.10181. Proteins. 2002. PMID: 12210997
-
Improving prediction of protein secondary structure using structured neural networks and multiple sequence alignments.J Comput Biol. 1996 Spring;3(1):163-83. doi: 10.1089/cmb.1996.3.163. J Comput Biol. 1996. PMID: 8697234
-
Prediction of turn types in protein structure by machine-learning classifiers.Proteins. 2009 Feb 1;74(2):344-52. doi: 10.1002/prot.22164. Proteins. 2009. PMID: 18618702
-
Protein secondary structure prediction with SPARROW.J Chem Inf Model. 2012 Feb 27;52(2):545-56. doi: 10.1021/ci200321u. Epub 2012 Jan 23. J Chem Inf Model. 2012. PMID: 22224407
Cited by
-
Screening of potential pseudo att sites of Streptomyces phage ΦC31 integrase in the human genome.Acta Pharmacol Sin. 2013 Apr;34(4):561-9. doi: 10.1038/aps.2012.173. Epub 2013 Feb 18. Acta Pharmacol Sin. 2013. PMID: 23416928 Free PMC article.
-
Constraint-based, homology model of the extracellular domain of the epithelial Na+ channel α subunit reveals a mechanism of channel activation by proteases.J Biol Chem. 2011 Jan 7;286(1):649-60. doi: 10.1074/jbc.M110.167098. Epub 2010 Oct 25. J Biol Chem. 2011. PMID: 20974852 Free PMC article.
-
How many 3D structures do we need to train a predictor?Genomics Proteomics Bioinformatics. 2009 Sep;7(3):128-37. doi: 10.1016/S1672-0229(08)60041-8. Genomics Proteomics Bioinformatics. 2009. PMID: 19944385 Free PMC article.
-
The PN2-3 domain of centrosomal P4.1-associated protein implements a novel mechanism for tubulin sequestration.J Biol Chem. 2009 Mar 13;284(11):6909-17. doi: 10.1074/jbc.M808249200. Epub 2009 Jan 7. J Biol Chem. 2009. PMID: 19131341 Free PMC article.
-
Structure and size determination of bacteriophage P2 and P4 procapsids: function of size responsiveness mutations.J Struct Biol. 2012 Jun;178(3):215-24. doi: 10.1016/j.jsb.2012.04.002. Epub 2012 Apr 9. J Struct Biol. 2012. PMID: 22508104 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
