How and how much does RAD-seq bias genetic diversity estimates?
- PMID: 27825303
- PMCID: PMC5100275
- DOI: 10.1186/s12862-016-0791-0
How and how much does RAD-seq bias genetic diversity estimates?
Abstract
Background: RAD-seq is a powerful tool, increasingly used in population genomics. However, earlier studies have raised red flags regarding possible biases associated with this technique. In particular, polymorphism on restriction sites results in preferential sampling of closely related haplotypes, so that RAD data tends to underestimate genetic diversity.
Results: Here we (1) clarify the theoretical basis of this bias, highlighting the potential confounding effects of population structure and selection, (2) confront predictions to real data from in silico digestion of full genomes and (3) provide a proof of concept toward an ABC-based correction of the RAD-seq bias. Under a neutral and panmictic model, we confirm the previously established relationship between the true polymorphism and its RAD-based estimation, showing a more pronounced bias when polymorphism is high. Using more elaborate models, we show that selection, resulting in heterogeneous levels of polymorphism along the genome, exacerbates the bias and leads to a more pronounced underestimation. On the contrary, spatial genetic structure tends to reduce the bias. We confront the neutral and panmictic model to "ideal" empirical data (in silico RAD-sequencing) using full genomes from natural populations of the fruit fly Drosophila melanogaster and the fungus Shizophyllum commune, harbouring respectively moderate and high genetic diversity. In D. melanogaster, predictions fit the model, but the small difference between the true and RAD polymorphism makes this comparison insensitive to deviations from the model. In the highly polymorphic fungus, the model captures a large part of the bias but makes inaccurate predictions. Accordingly, ABC corrections based on this model improve the estimations, albeit with some imprecisions.
Conclusion: The RAD-seq underestimation of genetic diversity associated with polymorphism in restriction sites becomes more pronounced when polymorphism is high. In practice, this means that in many systems where polymorphism does not exceed 2 %, the bias is of minor importance in the face of other sources of uncertainty, such as heterogeneous bases composition or technical artefacts. The neutral panmictic model provides a practical mean to correct the bias through ABC, albeit with some imprecisions. More elaborate ABC methods might integrate additional parameters, such as population structure and selection, but their opposite effects could hinder accurate corrections.
Keywords: ABC; Allele drop-out; Non-neutral model; Population genomics; Population structure; Reduced representation genomics.
Figures
Similar articles
-
Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal.BMC Genomics. 2019 Jan 22;20(1):72. doi: 10.1186/s12864-019-5440-8. BMC Genomics. 2019. PMID: 30669975 Free PMC article.
-
RADseq underestimates diversity and introduces genealogical biases due to nonrandom haplotype sampling.Mol Ecol. 2013 Jun;22(11):3179-90. doi: 10.1111/mec.12276. Epub 2013 Apr 3. Mol Ecol. 2013. PMID: 23551379
-
Population genomic analysis of model and nonmodel organisms using sequenced RAD tags.Methods Mol Biol. 2012;888:235-60. doi: 10.1007/978-1-61779-870-2_14. Methods Mol Biol. 2012. PMID: 22665285
-
Population genomics of transposable elements in Drosophila.Annu Rev Genet. 2014;48:561-81. doi: 10.1146/annurev-genet-120213-092359. Epub 2014 Oct 1. Annu Rev Genet. 2014. PMID: 25292358 Review.
-
Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.Syst Biol. 2016 Sep;65(5):910-24. doi: 10.1093/sysbio/syw036. Epub 2016 Jun 10. Syst Biol. 2016. PMID: 27288477 Review.
Cited by
-
A need for standardized reporting of introgression: Insights from studies across eukaryotes.Evol Lett. 2022 Jul 25;6(5):344-357. doi: 10.1002/evl3.294. eCollection 2022 Oct. Evol Lett. 2022. PMID: 36254258 Free PMC article.
-
Genome-wide markers reveal a complex evolutionary history involving divergence and introgression in the Abert's squirrel (Sciurus aberti) species group.BMC Evol Biol. 2018 Sep 12;18(1):139. doi: 10.1186/s12862-018-1248-4. BMC Evol Biol. 2018. PMID: 30208839 Free PMC article.
-
Adaptive Genetic Divergence Despite Significant Isolation-by-Distance in Populations of Taiwan Cow-Tail Fir (Keteleeria davidiana var. formosana).Front Plant Sci. 2018 Feb 1;9:92. doi: 10.3389/fpls.2018.00092. eCollection 2018. Front Plant Sci. 2018. PMID: 29449860 Free PMC article.
-
A historical stepping-stone path for an island-colonizing cactus across a submerged "bridge" archipelago.Heredity (Edinb). 2024 Jun;132(6):296-308. doi: 10.1038/s41437-024-00683-4. Epub 2024 Apr 18. Heredity (Edinb). 2024. PMID: 38637723
-
What Darwin could not see: island formation and historical sea levels shape genetic divergence and island biogeography in a coastal marine species.Heredity (Edinb). 2023 Sep;131(3):189-200. doi: 10.1038/s41437-023-00635-4. Epub 2023 Jul 3. Heredity (Edinb). 2023. PMID: 37400518 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
