The effects of alignment error and alignment filtering on the sitewise detection of positive selection
- PMID: 22049066
- DOI: 10.1093/molbev/msr272
The effects of alignment error and alignment filtering on the sitewise detection of positive selection
Abstract
When detecting positive selection in proteins, the prevalence of errors resulting from misalignment and the ability of alignment filters to mitigate such errors are not well understood, but filters are commonly applied to try to avoid false positive results. Focusing on the sitewise detection of positive selection across a wide range of divergence levels and indel rates, we performed simulation experiments to quantify the false positives and false negatives introduced by alignment error and the ability of alignment filters to improve performance. We found that some aligners led to many false positives, whereas others resulted in very few. False negatives were a problem for all aligners, increasing with sequence divergence. Of the aligners tested, PRANK's codon-based alignments consistently performed the best and ClustalW performed the worst. Of the filters tested, GUIDANCE performed the best and Gblocks performed the worst. Although some filters showed good ability to reduce the error rates from ClustalW and MAFFT alignments, none were found to substantially improve the performance of PRANK alignments under most conditions. Our results revealed distinct trends in error rates and power levels for aligners and filters within a biologically plausible parameter space. With the best aligner, a low false positive rate was maintained even with extremely divergent indel-prone sequences. Controls using the true alignment and an optimal filtering method suggested that performance improvements could be gained by improving aligners or filters to reduce the prevalence of false negatives, especially at higher divergence levels and indel rates.
Similar articles
-
The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection.Mol Biol Evol. 2010 Oct;27(10):2257-67. doi: 10.1093/molbev/msq115. Epub 2010 May 5. Mol Biol Evol. 2010. PMID: 20447933
-
Improving the performance of positive selection inference by filtering unreliable alignment regions.Mol Biol Evol. 2012 Jan;29(1):1-5. doi: 10.1093/molbev/msr177. Epub 2011 Jul 19. Mol Biol Evol. 2012. PMID: 21772063
-
Characterization of multiple sequence alignment errors using complete-likelihood score and position-shift map.BMC Bioinformatics. 2016 Mar 18;17:133. doi: 10.1186/s12859-016-0945-5. BMC Bioinformatics. 2016. PMID: 26992851 Free PMC article.
-
Alignment methods: strategies, challenges, benchmarking, and comparative overview.Methods Mol Biol. 2012;855:203-35. doi: 10.1007/978-1-61779-582-4_7. Methods Mol Biol. 2012. PMID: 22407710 Review.
-
Naturally selecting solutions: the use of genetic algorithms in bioinformatics.Bioengineered. 2013 Sep-Oct;4(5):266-78. doi: 10.4161/bioe.23041. Epub 2012 Dec 6. Bioengineered. 2013. PMID: 23222169 Free PMC article. Review.
Cited by
-
Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.BMC Genomics. 2015 Aug 7;16(1):585. doi: 10.1186/s12864-015-1724-9. BMC Genomics. 2015. PMID: 26250829 Free PMC article.
-
Parallel selection on ecologically relevant gene functions in the transcriptomes of highly diversifying salmonids.BMC Genomics. 2019 Dec 23;20(1):1010. doi: 10.1186/s12864-019-6361-2. BMC Genomics. 2019. PMID: 31870285 Free PMC article.
-
Divergence and adaptive evolution of the gibberellin oxidase genes in plants.BMC Evol Biol. 2015 Sep 29;15:207. doi: 10.1186/s12862-015-0490-2. BMC Evol Biol. 2015. PMID: 26416509 Free PMC article.
-
Gene Tree Affects Inference of Sites Under Selection by the Branch-Site Test of Positive Selection.Evol Bioinform Online. 2016 Jan 18;11(Suppl 2):11-7. doi: 10.4137/EBO.S30902. eCollection 2015. Evol Bioinform Online. 2016. PMID: 26819542 Free PMC article.
-
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.Mol Biol Evol. 2024 Sep 4;41(9):msae177. doi: 10.1093/molbev/msae177. Mol Biol Evol. 2024. PMID: 39172750 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
