The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection
- PMID: 20447933
- DOI: 10.1093/molbev/msq115
The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection
Abstract
The detection of positive Darwinian selection affecting protein-coding genes remains a topic of great interest and importance. The "branch-site" test is designed to detect localized episodic bouts of positive selection that affect only a few amino acid residues on particular lineages and has been shown to have reasonable power and low false-positive rates for a wide range of selection schemes. Previous simulations examining the performance of the test, however, were conducted under idealized conditions without insertions, deletions, or alignment errors. As the test is sometimes used to analyze divergent sequences, the impact of indels and alignment errors is a major concern. Here, we used a recently developed indel-simulation program to examine the false-positive rate and power of the branch-site test. We find that insertions and deletions do not cause excessive false positives if the alignment is correct, but alignment errors can lead to unacceptably high false positives. Of the alignment methods evaluated, PRANK consistently outperformed MUSCLE, MAFFT, and ClustalW, mostly because the latter programs tend to place nonhomologous codons (or amino acids) into the same column, producing shorter and less accurate alignments and giving the false impression that many amino acid substitutions have occurred at those sites. Our examination of two previous studies suggests that alignment errors may impact the analysis of mammalian and vertebrate genes by the branch-site test, and it is important to use reliable alignment methods.
Similar articles
-
Statistical properties of the branch-site test of positive selection.Mol Biol Evol. 2011 Mar;28(3):1217-28. doi: 10.1093/molbev/msq303. Epub 2010 Nov 18. Mol Biol Evol. 2011. PMID: 21087944
-
The effects of alignment error and alignment filtering on the sitewise detection of positive selection.Mol Biol Evol. 2012 Apr;29(4):1125-39. doi: 10.1093/molbev/msr272. Epub 2011 Nov 1. Mol Biol Evol. 2012. PMID: 22049066
-
Phylogeny-aware gap placement prevents errors in sequence alignment and evolutionary analysis.Science. 2008 Jun 20;320(5883):1632-5. doi: 10.1126/science.1158395. Science. 2008. PMID: 18566285
-
Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites.Mol Biol Evol. 2007 May;24(5):1219-28. doi: 10.1093/molbev/msm042. Epub 2007 Mar 5. Mol Biol Evol. 2007. PMID: 17339634
-
Small insertions and deletions (INDELs) in human genomes.Hum Mol Genet. 2010 Oct 15;19(R2):R131-6. doi: 10.1093/hmg/ddq400. Epub 2010 Sep 21. Hum Mol Genet. 2010. PMID: 20858594 Free PMC article. Review.
Cited by
-
Positive Selection in Gene Regulatory Factors Suggests Adaptive Pleiotropic Changes During Human Evolution.Front Genet. 2021 May 17;12:662239. doi: 10.3389/fgene.2021.662239. eCollection 2021. Front Genet. 2021. PMID: 34079582 Free PMC article.
-
Degeneration of the nonrecombining regions in the mating-type chromosomes of the anther-smut fungi.Mol Biol Evol. 2015 Apr;32(4):928-43. doi: 10.1093/molbev/msu396. Epub 2014 Dec 21. Mol Biol Evol. 2015. PMID: 25534033 Free PMC article.
-
Complete chloroplast genomes of eight Delphinium taxa (Ranunculaceae) endemic to Xinjiang, China: insights into genome structure, comparative analysis, and phylogenetic relationships.BMC Plant Biol. 2024 Jun 26;24(1):600. doi: 10.1186/s12870-024-05279-y. BMC Plant Biol. 2024. PMID: 38926811 Free PMC article.
-
Sequence shortening in the rodent ancestor.Genome Res. 2012 Mar;22(3):478-85. doi: 10.1101/gr.121897.111. Epub 2011 Nov 29. Genome Res. 2012. PMID: 22128134 Free PMC article.
-
phastSim: efficient simulation of sequence evolution for pandemic-scale datasets.bioRxiv [Preprint]. 2021 Sep 23:2021.03.15.435416. doi: 10.1101/2021.03.15.435416. bioRxiv. 2021. Update in: PLoS Comput Biol. 2022 Apr 29;18(4):e1010056. doi: 10.1371/journal.pcbi.1010056 PMID: 33758852 Free PMC article. Updated. Preprint.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
