Ultralow-input single-tube linked-read library method enables short-read second-generation sequencing systems to routinely generate highly accurate and economical long-range sequencing information

Genome Res. 2020 Jun;30(6):898-909. doi: 10.1101/gr.260380.119. Epub 2020 Jun 15.

Abstract

Long-range sequencing information is required for haplotype phasing, de novo assembly, and structural variation detection. Current long-read sequencing technologies can provide valuable long-range information but at a high cost with low accuracy and high DNA input requirements. We have developed a single-tube Transposase Enzyme Linked Long-read Sequencing (TELL-seq) technology, which enables a low-cost, high-accuracy, and high-throughput short-read second-generation sequencer to generate over 100 kb of long-range sequencing information with as little as 0.1 ng input material. In a PCR tube, millions of clonally barcoded beads are used to uniquely barcode long DNA molecules in an open bulk reaction without dilution and compartmentation. The barcoded linked-reads are used to successfully assemble genomes ranging from microbes to human. These linked-reads also generate megabase-long phased blocks and provide a cost-effective tool for detecting structural variants in a genome, which are important to identify compound heterozygosity in recessive Mendelian diseases and discover genetic drivers and diagnostic biomarkers in cancers.

MeSH terms

  • Computational Biology / methods
  • DNA Barcoding, Taxonomic / methods
  • Gene Library*
  • Genetic Variation
  • Genome, Human
  • Genomics / methods
  • HLA Antigens / genetics
  • Haplotypes
  • High-Throughput Nucleotide Sequencing* / methods
  • High-Throughput Nucleotide Sequencing* / standards
  • Humans
  • Sequence Analysis, DNA* / methods
  • Sequence Analysis, DNA* / standards
  • Workflow

Substances

  • HLA Antigens