Restriction site-associated DNA sequencing, genotyping error estimation and de novo assembly optimization for population genetic inference
- PMID: 24916682
- DOI: 10.1111/1755-0998.12291
Restriction site-associated DNA sequencing, genotyping error estimation and de novo assembly optimization for population genetic inference
Abstract
Restriction site-associated DNA sequencing (RADseq) provides researchers with the ability to record genetic polymorphism across thousands of loci for nonmodel organisms, potentially revolutionizing the field of molecular ecology. However, as with other genotyping methods, RADseq is prone to a number of sources of error that may have consequential effects for population genetic inferences, and these have received only limited attention in terms of the estimation and reporting of genotyping error rates. Here we use individual sample replicates, under the expectation of identical genotypes, to quantify genotyping error in the absence of a reference genome. We then use sample replicates to (i) optimize de novo assembly parameters within the program Stacks, by minimizing error and maximizing the retrieval of informative loci; and (ii) quantify error rates for loci, alleles and single-nucleotide polymorphisms. As an empirical example, we use a double-digest RAD data set of a nonmodel plant species, Berberis alpina, collected from high-altitude mountains in Mexico.
Keywords: RAD sequencing; Stacks; de novo assembly; error rate; optimization; replicates.
© 2014 John Wiley & Sons Ltd.
Similar articles
-
Finding the right coverage: the impact of coverage and sequence quality on single nucleotide polymorphism genotyping error rates.Mol Ecol Resour. 2016 Jul;16(4):966-78. doi: 10.1111/1755-0998.12519. Epub 2016 Mar 25. Mol Ecol Resour. 2016. PMID: 26946083
-
Double-digest RAD sequencing using Ion Proton semiconductor platform (ddRADseq-ion) with nonmodel organisms.Mol Ecol Resour. 2015 Nov;15(6):1316-29. doi: 10.1111/1755-0998.12406. Epub 2015 Apr 6. Mol Ecol Resour. 2015. PMID: 25808755
-
Deriving genotypes from RAD-seq short-read data using Stacks.Nat Protoc. 2017 Dec;12(12):2640-2659. doi: 10.1038/nprot.2017.123. Epub 2017 Nov 30. Nat Protoc. 2017. PMID: 29189774
-
Critical review of NGS analyses for de novo genotyping multigene families.Mol Ecol. 2014 Aug;23(16):3957-72. doi: 10.1111/mec.12843. Epub 2014 Jul 21. Mol Ecol. 2014. PMID: 24954669 Review.
-
SNP genotyping and population genomics from expressed sequences - current advances and future possibilities.Mol Ecol. 2015 May;24(10):2310-23. doi: 10.1111/mec.13165. Epub 2015 Apr 20. Mol Ecol. 2015. PMID: 25808983 Review.
Cited by
-
Phylogeography and phenotypic wing shape variation in a damselfly across populations in Europe.BMC Ecol Evol. 2024 Feb 3;24(1):19. doi: 10.1186/s12862-024-02207-4. BMC Ecol Evol. 2024. PMID: 38308224 Free PMC article.
-
Biogeographic factors contributing to the diversification of Euphoniinae (Aves, Passeriformes, Fringillidae): a phylogenetic and ancestral areas analysis.Zookeys. 2024 Jan 8;1188:169-195. doi: 10.3897/zookeys.1188.107047. eCollection 2024. Zookeys. 2024. PMID: 38230381 Free PMC article.
-
2b or not 2b? 2bRAD is an effective alternative to ddRAD for phylogenomics.Ecol Evol. 2023 Mar 8;13(3):e9842. doi: 10.1002/ece3.9842. eCollection 2023 Mar. Ecol Evol. 2023. PMID: 36911313 Free PMC article.
-
Picks in the Fabric of a Polyploidy Complex: Integrative Species Delimitation in the Tetraploid Leucanthemum Mill. (Compositae, Anthemideae) Representatives.Biology (Basel). 2023 Feb 10;12(2):288. doi: 10.3390/biology12020288. Biology (Basel). 2023. PMID: 36829565 Free PMC article.
-
Genomic analyses point to a low evolutionary potential of prospective source populations for assisted migration in a forest herb.Evol Appl. 2022 Oct 2;15(11):1859-1874. doi: 10.1111/eva.13485. eCollection 2022 Nov. Evol Appl. 2022. PMID: 36426124 Free PMC article.
Publication types
MeSH terms
Associated data
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
