Measurements of intrahost viral diversity require an unbiased diversity metric
- PMID: 30723551
- PMCID: PMC6354029
- DOI: 10.1093/ve/vey041
Measurements of intrahost viral diversity require an unbiased diversity metric
Abstract
Viruses exist within hosts at large population sizes and are subject to high rates of mutation. As such, viral populations exhibit considerable sequence diversity. A variety of summary statistics have been developed which describe, in a single number, the extent of diversity in a viral population; such measurements allow the diversities of different populations to be compared, and the effect of evolutionary forces on a population to be assessed. Here we highlight statistical artefacts underlying some common measures of sequence diversity, whereby variation in the depth of genome sequencing may substantially affect the extent of diversity measured in a viral population, making comparisons of population diversity invalid. Specifically, naive estimation of sequence entropy provides a systematically biased metric, a lower read depth being expected to produce a lower estimate of diversity. The number of polymorphic loci per kilobase of genome is more unpredictably affected by read depth, giving potentially flawed results at lower sequencing depths. We show that the nucleotide diversity statistic π provides an unbiased estimate of diversity in the sense that the expected value of the statistic is equal to the correct value of the property being measured. Our results are of importance for studies interpreting genome sequence data; we describe how diversity may be assessed in viral populations in a fair and unbiased manner.
Keywords: entropy; polymorphism; sequence data; virus diversity.
Figures
Similar articles
-
Endless Forms: Within-Host Variation in the Structure of the West Nile Virus RNA Genome during Serial Passage in Bird Hosts.mSphere. 2019 Jun 26;4(3):e00291-19. doi: 10.1128/mSphere.00291-19. mSphere. 2019. PMID: 31243074 Free PMC article.
-
Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling.J Virol. 2016 Jul 11;90(15):6884-95. doi: 10.1128/JVI.00667-16. Print 2016 Aug 1. J Virol. 2016. PMID: 27194763 Free PMC article.
-
Limited Intrahost Diversity and Background Evolution Accompany 40 Years of Canine Parvovirus Host Adaptation and Spread.J Virol. 2019 Dec 12;94(1):e01162-19. doi: 10.1128/JVI.01162-19. Print 2019 Dec 12. J Virol. 2019. PMID: 31619551 Free PMC article.
-
Ultra-deep sequencing for the analysis of viral populations.Curr Opin Virol. 2011 Nov;1(5):413-8. doi: 10.1016/j.coviro.2011.07.008. Epub 2011 Aug 17. Curr Opin Virol. 2011. PMID: 22440844 Review.
-
Understanding the complex evolution of rapidly mutating viruses with deep sequencing: Beyond the analysis of viral diversity.Virus Res. 2017 Jul 15;239:43-54. doi: 10.1016/j.virusres.2016.10.014. Epub 2016 Nov 22. Virus Res. 2017. PMID: 27888126 Review.
Cited by
-
SARS-CoV-2 evolution in animals suggests mechanisms for rapid variant selection.bioRxiv [Preprint]. 2021 Mar 9:2021.03.05.434135. doi: 10.1101/2021.03.05.434135. bioRxiv. 2021. Update in: Proc Natl Acad Sci U S A. 2021 Nov 2;118(44):e2105253118. doi: 10.1073/pnas.2105253118 PMID: 33758844 Free PMC article. Updated. Preprint.
-
Long-read sequencing reveals the evolutionary drivers of intra-host diversity across natural RNA mycovirus infections.Virus Evol. 2021 Dec 1;7(2):veab101. doi: 10.1093/ve/veab101. eCollection 2021 Sep. Virus Evol. 2021. PMID: 35299787 Free PMC article.
-
Standing Genetic Diversity and Transmission Bottleneck Size Drive Adaptation in Bacteriophage Qβ.Int J Mol Sci. 2022 Aug 9;23(16):8876. doi: 10.3390/ijms23168876. Int J Mol Sci. 2022. PMID: 36012143 Free PMC article.
-
Patterns of within-host genetic diversity in SARS-CoV-2.Elife. 2021 Aug 13;10:e66857. doi: 10.7554/eLife.66857. Elife. 2021. PMID: 34387545 Free PMC article.
-
SARS-CoV-2 evolution in animals suggests mechanisms for rapid variant selection.Proc Natl Acad Sci U S A. 2021 Nov 2;118(44):e2105253118. doi: 10.1073/pnas.2105253118. Proc Natl Acad Sci U S A. 2021. PMID: 34716263 Free PMC article.
References
-
- Beerenwinkel N., Zagordi O. (2011) ‘Ultra-Deep Sequencing for the Analysis of Viral Populations’, Current Opinion in Virology, 1: 413. - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous
