The next generation of target capture technologies - large DNA fragment enrichment and sequencing determines regional genomic variation of high complexity

BMC Genomics. 2016 Jul 9:17:486. doi: 10.1186/s12864-016-2836-6.

Abstract

Background: The ability to capture and sequence large contiguous DNA fragments represents a significant advancement towards the comprehensive characterization of complex genomic regions. While emerging sequencing platforms are capable of producing several kilobases-long reads, the fragment sizes generated by current DNA target enrichment technologies remain a limiting factor, producing DNA fragments generally shorter than 1 kbp. The DNA enrichment methodology described herein, Region-Specific Extraction (RSE), produces DNA segments in excess of 20 kbp in length. Coupling this enrichment method to appropriate sequencing platforms will significantly enhance the ability to generate complete and accurate sequence characterization of any genomic region without the need for reference-based assembly.

Results: RSE is a long-range DNA target capture methodology that relies on the specific hybridization of short (20-25 base) oligonucleotide primers to selected sequence motifs within the DNA target region. These capture primers are then enzymatically extended on the 3'-end, incorporating biotinylated nucleotides into the DNA. Streptavidin-coated beads are subsequently used to pull-down the original, long DNA template molecules via the newly synthesized, biotinylated DNA that is bound to them. We demonstrate the accuracy, simplicity and utility of the RSE method by capturing and sequencing a 4 Mbp stretch of the major histocompatibility complex (MHC). Our results show an average depth of coverage of 164X for the entire MHC. This depth of coverage contributes significantly to a 99.94 % total coverage of the targeted region and to an accuracy that is over 99.99 %.

Conclusions: RSE represents a cost-effective target enrichment method capable of producing sequencing templates in excess of 20 kbp in length. The utility of our method has been proven to generate superior coverage across the MHC as compared to other commercially available methodologies, with the added advantage of producing longer sequencing templates amenable to DNA sequencing on recently developed platforms. Although our demonstration of the method does not utilize these DNA sequencing platforms directly, our results indicate that the capture of long DNA fragments produce superior coverage of the targeted region.

Keywords: DNA sequencing; DNA target capture; Genomic resequencing; MHC haplotype; Targeted enrichment.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Comparative Genomic Hybridization / methods
  • DNA Primers
  • Databases, Genetic
  • Genetic Variation*
  • Genome, Human*
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing* / methods
  • Humans
  • Major Histocompatibility Complex / genetics
  • Nucleic Acid Hybridization
  • Real-Time Polymerase Chain Reaction
  • Sequence Analysis, DNA

Substances

  • DNA Primers