In this chapter, we describe Long Fragment Read (LFR) technology, a DNA preprocessing method for genome-wide haplotyping by whole genome sequencing (WGS). The addition of LFR prior to WGS on any high-throughput DNA sequencer (e.g., Complete Genomics Revolocity™, BGISEQ-500, Illumina HiSeq, etc.) enables the assignment of single-nucleotide polymorphisms (SNPs) and other genomic variants onto contigs representing contiguous DNA from a single parent (haplotypes) with N50 lengths of up to ~1 Mb. Importantly, this is achieved independent of any parental sequencing data or knowledge of parental haplotypes. Further, the nature of this method allows for the correction of most amplification, sequencing, and mapping errors, resulting in false-positive error rates as low as 10-9. This method can be employed either manually using hand-held micropipettors or in the preferred, automated manner described below, utilizing liquid-handling robots capable of pipetting in the nanoliter range. Automating the method limits the amount of hands-on time and allows significant reduction in reaction volumes. Further, the cost of LFR, as described in this chapter, is moderate, while it adds invaluable whole genome haplotype data to almost any WGS process.
Keywords: Haplotype; Long fragment; Molecular barcode; Next-generation sequencing (NGS); Phasing; Whole-genome amplification (WGA).