MUSCLE: multiple sequence alignment with high accuracy and high throughput
- PMID: 15034147
- PMCID: PMC390337
- DOI: 10.1093/nar/gkh340
MUSCLE: multiple sequence alignment with high accuracy and high throughput
Abstract
We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the log-expectation score, and refinement using tree-dependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle.
Figures
Similar articles
-
MUSCLE: a multiple sequence alignment method with reduced time and space complexity.BMC Bioinformatics. 2004 Aug 19;5:113. doi: 10.1186/1471-2105-5-113. BMC Bioinformatics. 2004. PMID: 15318951 Free PMC article.
-
MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities.Bioinformatics. 2010 Aug 15;26(16):1958-64. doi: 10.1093/bioinformatics/btq338. Epub 2010 Jun 23. Bioinformatics. 2010. PMID: 20576627
-
Improvement in the accuracy of multiple sequence alignment program MAFFT.Genome Inform. 2005;16(1):22-33. Genome Inform. 2005. PMID: 16362903
-
Mind the gaps: evidence of bias in estimates of multiple sequence alignments.Mol Biol Evol. 2007 Nov;24(11):2433-42. doi: 10.1093/molbev/msm176. Epub 2007 Aug 20. Mol Biol Evol. 2007. PMID: 17709332
-
Multiple sequence alignment.Curr Opin Struct Biol. 2006 Jun;16(3):368-73. doi: 10.1016/j.sbi.2006.04.004. Epub 2006 May 5. Curr Opin Struct Biol. 2006. PMID: 16679011 Review.
Cited by
-
Discovery of a new species of the subgenus Japonigekko (Squamata, Gekkonidae, Gekko) from the Hengduan Mountains, southwestern China: the best Japonigekko mountaineer.Zookeys. 2024 Oct 17;1215:289-309. doi: 10.3897/zookeys.1215.125043. eCollection 2024. Zookeys. 2024. PMID: 39464300 Free PMC article.
-
Russula rubrosquamosa (Russulaceae, Russulales), a new species from southwestern China.Mycoscience. 2024 May 31;65(4):162-172. doi: 10.47371/mycosci.2024.02.009. eCollection 2024. Mycoscience. 2024. PMID: 39493652 Free PMC article.
-
Facilitation in the soil microbiome does not necessarily lead to niche expansion.Environ Microbiome. 2021 Feb 15;16(1):4. doi: 10.1186/s40793-021-00373-2. Environ Microbiome. 2021. PMID: 33902741 Free PMC article.
-
Detection of the Wolbachia-encoded DNA binding protein, HU beta, in mosquito gonads.Insect Biochem Mol Biol. 2013 Mar;43(3):272-9. doi: 10.1016/j.ibmb.2012.12.007. Epub 2012 Dec 31. Insect Biochem Mol Biol. 2013. PMID: 23287400 Free PMC article.
-
Diversity and plant growth-promoting potential of (un)culturable bacteria in the Hedera helix phylloplane.BMC Microbiol. 2021 Feb 27;21(1):66. doi: 10.1186/s12866-021-02119-z. BMC Microbiol. 2021. PMID: 33639859 Free PMC article.
References
-
- Wang L. and Jiang,T. (1994) On the complexity of multiple sequence alignment. J. Comput. Biol., 1, 337–348. - PubMed
-
- Waterman M.S., Smith,T.F. and Beyer,W.A. (1976) Some biological sequence metrics. Adv. Math., 20, 367–387.
-
- Hogeweg P. and Hesper,B. (1984) The alignment of sets of sequences and the construction of phyletic trees: an integrated method. J. Mol. Evol., 20, 175–186. - PubMed
-
- Feng D.F. and Doolittle,R.F. (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J. Mol. Evol., 25, 351–360. - PubMed
-
- Notredame C., Higgins,D.G. and Heringa,J. (2000) T-Coffee: a novel method for fast and accurate multiple sequence alignment. J. Mol. Biol., 302, 205–217. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
