On the effective depth of viral sequence data
- PMID: 29250429
- PMCID: PMC5724399
- DOI: 10.1093/ve/vex030
On the effective depth of viral sequence data
Abstract
Genome sequence data are of great value in describing evolutionary processes in viral populations. However, in such studies, the extent to which data accurately describes the viral population is a matter of importance. Multiple factors may influence the accuracy of a dataset, including the quantity and nature of the sample collected, and the subsequent steps in viral processing. To investigate this phenomenon, we sequenced replica datasets spanning a range of viruses, and in which the point at which samples were split was different in each case, from a dataset in which independent samples were collected from a single patient to another in which all processing steps up to sequencing were applied to a single sample before splitting the sample and sequencing each replicate. We conclude that neither a high read depth nor a high template number in a sample guarantee the precision of a dataset. Measures of consistency calculated from within a single biological sample may also be insufficient; distortion of the composition of a population by the experimental procedure or genuine within-host diversity between samples may each affect the results. Where it is possible, data from replicate samples should be collected to validate the consistency of short-read sequence data.
Keywords: evolutionary modelling; population genetics; sequence data.
Figures
Similar articles
-
Measurements of intrahost viral diversity require an unbiased diversity metric.Virus Evol. 2019 Jan 30;5(1):vey041. doi: 10.1093/ve/vey041. eCollection 2019 Jan. Virus Evol. 2019. PMID: 30723551 Free PMC article.
-
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015. Front Bioeng Biotechnol. 2015. PMID: 26442255 Free PMC article.
-
Benchmarking viromics: an in silico evaluation of metagenome-enabled estimates of viral community composition and diversity.PeerJ. 2017 Sep 21;5:e3817. doi: 10.7717/peerj.3817. eCollection 2017. PeerJ. 2017. PMID: 28948103 Free PMC article.
-
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
-
Exploring the hepatitis C virus genome using single molecule real-time sequencing.World J Gastroenterol. 2019 Aug 28;25(32):4661-4672. doi: 10.3748/wjg.v25.i32.4661. World J Gastroenterol. 2019. PMID: 31528092 Free PMC article. Review.
Cited by
-
A2B-COVID: A Tool for Rapidly Evaluating Potential SARS-CoV-2 Transmission Events.Mol Biol Evol. 2022 Mar 2;39(3):msac025. doi: 10.1093/molbev/msac025. Mol Biol Evol. 2022. PMID: 35106603 Free PMC article.
-
Measurements of intrahost viral diversity require an unbiased diversity metric.Virus Evol. 2019 Jan 30;5(1):vey041. doi: 10.1093/ve/vey041. eCollection 2019 Jan. Virus Evol. 2019. PMID: 30723551 Free PMC article.
-
Genomic analyses of Symbiomonas scintillans show no evidence for endosymbiotic bacteria but does reveal the presence of giant viruses.PLoS Genet. 2024 Apr 1;20(4):e1011218. doi: 10.1371/journal.pgen.1011218. eCollection 2024 Apr. PLoS Genet. 2024. PMID: 38557755 Free PMC article.
-
A novel framework for inferring parameters of transmission from viral sequence data.PLoS Genet. 2018 Oct 16;14(10):e1007718. doi: 10.1371/journal.pgen.1007718. eCollection 2018 Oct. PLoS Genet. 2018. PMID: 30325921 Free PMC article.
-
A large effective population size for established within-host influenza virus infection.Elife. 2020 Aug 10;9:e56915. doi: 10.7554/eLife.56915. Elife. 2020. PMID: 32773034 Free PMC article.
References
-
- Ait-Khaled M. et al. (1995) ‘Distinct HIV-1 long terminal repeat quasispecies present in nervous tissues compared to that in lung, blood and lymphoid tissues of an AIDS patient’, AIDS, 9/7: 675–683. - PubMed
-
- Beerenwinkel N., Zagordi O. (2011) ‘Ultra-deep sequencing for the analysis of viral populations’, Current Opinion Virology, 1/5: 413–418. - PubMed
LinkOut - more resources
Full Text Sources
Other Literature Sources
