Protein multiple alignments: sequence-based versus structure-based programs
- PMID: 30942864
- DOI: 10.1093/bioinformatics/btz236
Protein multiple alignments: sequence-based versus structure-based programs
Abstract
Motivation: Multiple sequence alignment programs have proved to be very useful and have already been evaluated in the literature yet not alignment programs based on structure or both sequence and structure. In the present article we wish to evaluate the added value provided through considering structures.
Results: We compared the multiple alignments resulting from 25 programs either based on sequence, structure or both, to reference alignments deposited in five databases (BALIBASE 2 and 3, HOMSTRAD, OXBENCH and SISYPHUS). On the whole, the structure-based methods compute more reliable alignments than the sequence-based ones, and even than the sequence+structure-based programs whatever the databases. Two programs lead, MAMMOTH and MATRAS, nevertheless the performances of MUSTANG, MATT, 3DCOMB, TCOFFEE+TM_ALIGN and TCOFFEE+SAP are better for some alignments. The advantage of structure-based methods increases at low levels of sequence identity, or for residues in regular secondary structures or buried ones. Concerning gap management, sequence-based programs set less gaps than structure-based programs. Concerning the databases, the alignments of the manually built databases are more challenging for the programs.
Availability and implementation: All data and results presented in this study are available at: http://wwwabi.snv.jussieu.fr/people/mathilde/download/AliMulComp/.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
mTM-align: an algorithm for fast and accurate multiple protein structure alignment.Bioinformatics. 2018 May 15;34(10):1719-1725. doi: 10.1093/bioinformatics/btx828. Bioinformatics. 2018. PMID: 29281009 Free PMC article.
-
Iterative refinement of structure-based sequence alignments by Seed Extension.BMC Bioinformatics. 2009 Jul 9;10:210. doi: 10.1186/1471-2105-10-210. BMC Bioinformatics. 2009. PMID: 19589133 Free PMC article.
-
MUSTANG: a multiple structural alignment algorithm.Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921. Proteins. 2006. PMID: 16736488
-
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47. BMC Bioinformatics. 2003. PMID: 14552658 Free PMC article.
-
Formatt: Correcting protein multiple structural alignments by incorporating sequence alignment.BMC Bioinformatics. 2012 Oct 6;13:259. doi: 10.1186/1471-2105-13-259. BMC Bioinformatics. 2012. PMID: 23039758 Free PMC article.
Cited by
-
Calpains as mechanistic drivers and therapeutic targets for ocular disease.Trends Mol Med. 2022 Aug;28(8):644-661. doi: 10.1016/j.molmed.2022.05.007. Epub 2022 May 29. Trends Mol Med. 2022. PMID: 35641420 Free PMC article. Review.
-
Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamilies.Comput Struct Biotechnol J. 2021 Feb 23;19:1302-1311. doi: 10.1016/j.csbj.2021.02.005. eCollection 2021. Comput Struct Biotechnol J. 2021. PMID: 33738079 Free PMC article.
-
Fermentation optimization of cellulase production from sugarcane bagasse by Bacillus pseudomycoides and molecular modeling study of cellulase.Curr Res Microb Sci. 2020 Nov 27;2:100013. doi: 10.1016/j.crmicr.2020.100013. eCollection 2021 Dec. Curr Res Microb Sci. 2020. PMID: 34841306 Free PMC article.
-
PC_ali: a tool for improved multiple alignments and evolutionary inference based on a hybrid protein sequence and structure similarity score.Bioinformatics. 2023 Nov 1;39(11):btad630. doi: 10.1093/bioinformatics/btad630. Bioinformatics. 2023. PMID: 37847775 Free PMC article.
-
Large-scale structure-informed multiple sequence alignment of proteins with SIMSApiper.Bioinformatics. 2024 May 2;40(5):btae276. doi: 10.1093/bioinformatics/btae276. Bioinformatics. 2024. PMID: 38648741 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous
