Medical implications of technical accuracy in genome sequencing
- PMID: 26932475
- PMCID: PMC4774017
- DOI: 10.1186/s13073-016-0269-0
Medical implications of technical accuracy in genome sequencing
Abstract
Background: As whole exome sequencing (WES) and whole genome sequencing (WGS) transition from research tools to clinical diagnostic tests, it is increasingly critical for sequencing methods and analysis pipelines to be technically accurate. The Genome in a Bottle Consortium has recently published a set of benchmark SNV, indel, and homozygous reference genotypes for the pilot whole genome NIST Reference Material based on the NA12878 genome.
Methods: We examine the relationship between human genome complexity and genes/variants reported to be associated with human disease. Specifically, we map regions of medical relevance to benchmark regions of high or low confidence. We use benchmark data to assess the sensitivity and positive predictive value of two representative sequencing pipelines for specific classes of variation.
Results: We observe that the accuracy of a variant call depends on the genomic region, variant type, and read depth, and varies by analytical pipeline. We find that most false negative WGS calls result from filtering while most false negative WES variants relate to poor coverage. We find that only 74.6% of the exonic bases in ClinVar and OMIM genes and 82.1% of the exonic bases in ACMG-reportable genes are found in high-confidence regions. Only 990 genes in the genome are found entirely within high-confidence regions while 593 of 3,300 ClinVar/OMIM genes have less than 50% of their total exonic base pairs in high-confidence regions. We find greater than 77 % of the pathogenic or likely pathogenic SNVs currently in ClinVar fall within high-confidence regions. We identify sites that are prone to sequencing errors, including thousands present in publicly available variant databases. Finally, we examine the clinical impact of mandatory reporting of secondary findings, highlighting a false positive variant found in BRCA2.
Conclusions: Together, these data illustrate the importance of appropriate use and continued improvement of technical benchmarks to ensure accurate and judicious interpretation of next-generation DNA sequencing results in the clinical setting.
Figures
Comment in
-
Genetic testing: Clinical sequencing right on target.Nat Rev Genet. 2016 May;17(5):253. doi: 10.1038/nrg.2016.34. Epub 2016 Mar 21. Nat Rev Genet. 2016. PMID: 26996078 No abstract available.
Similar articles
-
Interplay between probe design and test performance: overlap between genomic regions of interest, capture regions and high quality reference calls influence performance of WES-based assays.Hum Genet. 2021 Feb;140(2):289-297. doi: 10.1007/s00439-020-02201-y. Epub 2020 Jul 5. Hum Genet. 2021. PMID: 32627054
-
Archived neonatal dried blood spot samples can be used for accurate whole genome and exome-targeted next-generation sequencing.Mol Genet Metab. 2013 Sep-Oct;110(1-2):65-72. doi: 10.1016/j.ymgme.2013.06.004. Epub 2013 Jun 13. Mol Genet Metab. 2013. PMID: 23830478
-
From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.Hum Mutat. 2016 Dec;37(12):1263-1271. doi: 10.1002/humu.23114. Epub 2016 Sep 26. Hum Mutat. 2016. PMID: 27604516 Free PMC article.
-
Clinical sequencing: From raw data to diagnosis with lifetime value.Clin Genet. 2018 Mar;93(3):508-519. doi: 10.1111/cge.13190. Clin Genet. 2018. PMID: 29206278 Review.
-
Use of whole exome and genome sequencing in the identification of genetic causes of primary immunodeficiencies.Curr Opin Allergy Clin Immunol. 2012 Dec;12(6):623-8. doi: 10.1097/ACI.0b013e3283588ca6. Curr Opin Allergy Clin Immunol. 2012. PMID: 23095910 Review.
Cited by
-
A novel synthetic nucleic acid mixture for quantification of microbes by mNGS.Microb Genom. 2024 Feb;10(2):001199. doi: 10.1099/mgen.0.001199. Microb Genom. 2024. PMID: 38358316 Free PMC article.
-
The diagnostic odyssey of a patient with dihydropyrimidinase deficiency: a case report and review of the literature.Cold Spring Harb Mol Case Stud. 2024 Jan 10;9(4):a006319. doi: 10.1101/mcs.a006319. Print 2023 Dec. Cold Spring Harb Mol Case Stud. 2024. PMID: 38199782 Free PMC article. Review.
-
A precision overview of genomic resistance screening in Ecuadorian isolates of Mycobacterium tuberculosis using web-based bioinformatics tools.PLoS One. 2023 Dec 5;18(12):e0294670. doi: 10.1371/journal.pone.0294670. eCollection 2023. PLoS One. 2023. PMID: 38051742 Free PMC article.
-
Quartet DNA reference materials and datasets for comprehensively evaluating germline variant calling performance.Genome Biol. 2023 Nov 27;24(1):270. doi: 10.1186/s13059-023-03109-2. Genome Biol. 2023. PMID: 38012772 Free PMC article.
-
The Future of Newborn Genomic Testing.Children (Basel). 2023 Jun 30;10(7):1140. doi: 10.3390/children10071140. Children (Basel). 2023. PMID: 37508635 Free PMC article. Review.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous
