TM-align: a protein structure alignment algorithm based on the TM-score
- PMID: 15849316
- PMCID: PMC1084323
- DOI: 10.1093/nar/gki524
TM-align: a protein structure alignment algorithm based on the TM-score
Abstract
We have developed TM-align, a new algorithm to identify the best structural alignment between protein pairs that combines the TM-score rotation matrix and Dynamic Programming (DP). The algorithm is approximately 4 times faster than CE and 20 times faster than DALI and SAL. On average, the resulting structure alignments have higher accuracy and coverage than those provided by these most often-used methods. TM-align is applied to an all-against-all structure comparison of 10 515 representative protein chains from the Protein Data Bank (PDB) with a sequence identity cutoff <95%: 1996 distinct folds are found when a TM-score threshold of 0.5 is used. We also use TM-align to match the models predicted by TASSER for solved non-homologous proteins in PDB. For both folded and misfolded models, TM-align can almost always find close structural analogs, with an average root mean square deviation, RMSD, of 3 A and 87% alignment coverage. Nevertheless, there exists a significant correlation between the correctness of the predicted structure and the structural similarity of the model to the other proteins in the PDB. This correlation could be used to assist in model selection in blind protein structure predictions. The TM-align program is freely downloadable at http://bioinformatics.buffalo.edu/TM-align.
Figures
Similar articles
-
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531. BMC Bioinformatics. 2008. PMID: 19077267 Free PMC article.
-
Scoring function for automated assessment of protein structure template quality.Proteins. 2004 Dec 1;57(4):702-10. doi: 10.1002/prot.20264. Proteins. 2004. PMID: 15476259
-
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015. PLoS One. 2015. PMID: 26502070 Free PMC article.
-
Comparison of proteins based on segments structural similarity.Acta Biochim Pol. 2004;51(1):161-72. Acta Biochim Pol. 2004. PMID: 15094837 Review.
-
A model for statistical significance of local similarities in structure.J Mol Biol. 2003 Mar 7;326(5):1307-16. doi: 10.1016/s0022-2836(03)00045-7. J Mol Biol. 2003. PMID: 12595245 Review.
Cited by
-
Diversity, Distribution and Structural Prediction of the Pathogenic Bacterial Effectors EspN and EspS.Genes (Basel). 2024 Sep 26;15(10):1250. doi: 10.3390/genes15101250. Genes (Basel). 2024. PMID: 39457374 Free PMC article.
-
Multiobjective heuristic algorithm for de novo protein design in a quantified continuous sequence space.Comput Struct Biotechnol J. 2021 Apr 25;19:2575-2587. doi: 10.1016/j.csbj.2021.04.046. eCollection 2021. Comput Struct Biotechnol J. 2021. PMID: 34025944 Free PMC article.
-
Phenotype-specific adverse effects of XPD mutations on human prenatal development implicate impairment of TFIIH-mediated functions in placenta.Eur J Hum Genet. 2012 Jun;20(6):626-31. doi: 10.1038/ejhg.2011.249. Epub 2012 Jan 11. Eur J Hum Genet. 2012. PMID: 22234153 Free PMC article.
-
Improved protein complex prediction with AlphaFold-multimer by denoising the MSA profile.PLoS Comput Biol. 2024 Jul 25;20(7):e1012253. doi: 10.1371/journal.pcbi.1012253. eCollection 2024 Jul. PLoS Comput Biol. 2024. PMID: 39052676 Free PMC article.
-
How many protein-protein interactions types exist in nature?PLoS One. 2012;7(6):e38913. doi: 10.1371/journal.pone.0038913. Epub 2012 Jun 13. PLoS One. 2012. PMID: 22719985 Free PMC article.
References
-
- Murzin A.G., Brenner S.E., Hubbard T., Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 1995;247:536–540. - PubMed
-
- Orengo C.A., Michie A.D., Jones S., Jones D.T., Swindells M.B., Thornton J.M. CATH—a hierarchic classification of protein domain structures. Structure. 1997;5:1093–1108. - PubMed
-
- Moult J., Fidelis K., Zemla A., Hubbard T. Critical assessment of methods of protein structure prediction (CASP)-round V. Proteins. 2003;53:334–339. - PubMed
-
- Skolnick J., Fetrow J.S., Kolinski A. Structural genomics and its importance for gene function analysis. Nat. Biotechnol. 2000;18:283–287. - PubMed
-
- Baker D., Sali A. Protein structure prediction and structural genomics. Science. 2001;294:93–96. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
