pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination

Anna Lobley; Michael I Sadowski; David T Jones

doi:10.1093/bioinformatics/btp302

pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination

Bioinformatics. 2009 Jul 15;25(14):1761-7. doi: 10.1093/bioinformatics/btp302. Epub 2009 May 7.

Authors

Anna Lobley¹, Michael I Sadowski, David T Jones

Affiliation

¹ Department of Computer Science, University College London, UK.

PMID: 19429599
DOI: 10.1093/bioinformatics/btp302

Abstract

Motivation: Generation of structural models and recognition of homologous relationships for unannotated protein sequences are fundamental problems in bioinformatics. Improving the sensitivity and selectivity of methods designed for these two tasks therefore has downstream benefits for many other bioinformatics applications.

Results: We describe the latest implementation of the GenTHREADER method for structure prediction on a genomic scale. The method combines profile-profile alignments with secondary-structure specific gap-penalties, classic pair- and solvation potentials using a linear combination optimized with a regression SVM model. We find this combination significantly improves both detection of useful templates and accuracy of sequence-structure alignments relative to other competitive approaches. We further present a second implementation of the protocol designed for the task of discriminating superfamilies from one another. This method, pDomTHREADER, is the first to incorporate both sequence and structural data directly in this task and improves sensitivity and selectivity over the standard version of pGenTHREADER and three other standard methods for remote homology detection.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Protein Folding
Protein Structure, Tertiary*
Proteins / chemistry
Proteins / classification*
Sequence Analysis, Protein / methods
Software*

Substances

Proteins

Grants and funding

Biotechnology and Biological Sciences Research Council/United Kingdom