Biases in the SMART-DNA library preparation method associated with genomic poly dA/dT sequences

PLoS One. 2017 Feb 24;12(2):e0172769. doi: 10.1371/journal.pone.0172769. eCollection 2017.

Abstract

Avoiding biases in next generation sequencing (NGS) library preparation is crucial for obtaining reliable sequencing data. Recently, a new library preparation method has been introduced which has eliminated the need for the ligation step. This method, termed SMART (switching mechanism at the 5' end of the RNA transcript), is based on template switching reverse transcription. To date, there has been no systematic analysis of the additional biases introduced by this method. We analysed the genomic distribution of sequenced reads prepared from genomic DNA using the SMART methodology and found a strong bias toward long (≥12bp) poly dA/dT containing genomic loci. This bias is unique to the SMART-based library preparation and does not appear when libraries are prepared with conventional ligation based methods. Although this bias is obvious only when performing paired end sequencing, it affects single end sequenced samples as well. Our analysis demonstrates that sequenced reads originating from SMART-DNA libraries are heavily skewed toward genomic poly dA/dT tracts. This bias needs to be considered when deciding to use SMART based technology for library preparation.

MeSH terms

  • Gene Library*
  • Genome
  • Genomics*
  • High-Throughput Nucleotide Sequencing / methods*
  • Poly dA-dT / genetics*
  • RNA / genetics
  • Sequence Analysis, DNA / methods

Substances

  • Poly dA-dT
  • RNA