Sources of PCR-induced distortions in high-throughput sequencing data sets
- PMID: 26187991
- PMCID: PMC4666380
- DOI: 10.1093/nar/gkv717
Sources of PCR-induced distortions in high-throughput sequencing data sets
Abstract
PCR permits the exponential and sequence-specific amplification of DNA, even from minute starting quantities. PCR is a fundamental step in preparing DNA samples for high-throughput sequencing. However, there are errors associated with PCR-mediated amplification. Here we examine the effects of four important sources of error-bias, stochasticity, template switches and polymerase errors-on sequence representation in low-input next-generation sequencing libraries. We designed a pool of diverse PCR amplicons with a defined structure, and then used Illumina sequencing to search for signatures of each process. We further developed quantitative models for each process, and compared predictions of these models to our experimental data. We find that PCR stochasticity is the major force skewing sequence representation after amplification of a pool of unique DNA amplicons. Polymerase errors become very common in later cycles of PCR but have little impact on the overall sequence distribution as they are confined to small copy numbers. PCR template switches are rare and confined to low copy numbers. Our results provide a theoretical basis for removing distortions from high-throughput sequencing data. In addition, our findings on PCR stochasticity will have particular relevance to quantification of results from single cell sequencing, in which sequences are represented by only one or a few molecules.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Similar articles
-
Enzymological description of multitemplate PCR-Shrinking amplification bias by optimizing the polymerase-template ratio.J Theor Biol. 2015 Oct 7;382:178-86. doi: 10.1016/j.jtbi.2015.06.048. Epub 2015 Jul 9. J Theor Biol. 2015. PMID: 26164060
-
Primer ID Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations.J Virol. 2015 Aug;89(16):8540-55. doi: 10.1128/JVI.00522-15. Epub 2015 Jun 3. J Virol. 2015. PMID: 26041299 Free PMC article.
-
Benefits and Challenges with Applying Unique Molecular Identifiers in Next Generation Sequencing to Detect Low Frequency Mutations.PLoS One. 2016 Jan 11;11(1):e0146638. doi: 10.1371/journal.pone.0146638. eCollection 2016. PLoS One. 2016. PMID: 26752634 Free PMC article.
-
[Quantitative PCR in the diagnosis of Leishmania].Parassitologia. 2004 Jun;46(1-2):163-7. Parassitologia. 2004. PMID: 15305709 Review. Italian.
-
[Polymerase chain reaction, cold probes and clinical diagnosis].Sante. 1994 Jan-Feb;4(1):43-52. Sante. 1994. PMID: 7909267 Review. French.
Cited by
-
Genomic Mutations in SARS-CoV-2 Genome following Infection in Syrian Golden Hamster and Associated Lung Pathologies.Pathogens. 2023 Nov 8;12(11):1328. doi: 10.3390/pathogens12111328. Pathogens. 2023. PMID: 38003792 Free PMC article.
-
Production of structurally diverse sphingolipids by anaerobic marine bacteria in the euxinic Black Sea water column.ISME J. 2024 Jan 8;18(1):wrae153. doi: 10.1093/ismejo/wrae153. ISME J. 2024. PMID: 39113610 Free PMC article.
-
CRISPR-Cas13: A new technology for the rapid detection of pathogenic microorganisms.Front Microbiol. 2022 Oct 28;13:1011399. doi: 10.3389/fmicb.2022.1011399. eCollection 2022. Front Microbiol. 2022. PMID: 36386639 Free PMC article. Review.
-
CRISPR/cas systems redefine nucleic acid detection: Principles and methods.Biosens Bioelectron. 2020 Oct 1;165:112430. doi: 10.1016/j.bios.2020.112430. Epub 2020 Jul 8. Biosens Bioelectron. 2020. PMID: 32729545 Free PMC article. Review.
-
Ultra-efficient sequencing of T Cell receptor repertoires reveals shared responses in muscle from patients with Myositis.EBioMedicine. 2020 Sep;59:102972. doi: 10.1016/j.ebiom.2020.102972. Epub 2020 Sep 3. EBioMedicine. 2020. PMID: 32891935 Free PMC article.
References
-
- Dabney J., Meyer M. Length and GC-biases during sequencing library amplification: a comparison of various polymerase-buffer systems with ancient and modern DNA sequencing libraries. Biotechniques. 2012;52:87–94. - PubMed
-
- Jagers P., Klebaner F. Random variation and concentration effects in PCR. J. Theor. Biol. 2003;224:304–299. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
