Use of ordered deletions in genome sequencing

Gene. 1997 Sep 15;197(1-2):367-73. doi: 10.1016/s0378-1119(97)00285-0.

Abstract

Previous attempts to use the non-random approach for sequencing long DNA fragments have met with little success. As a result, nearly all genomic sequencing is done by the random (shotgun) approach, and the economy promised by the non-random approach has so far not materialized. Here we describe a simple system based on the use of ordered deletions that can be incorporated in the common strategies for genome sequencing. Long genomic fragments are cloned in the pAL-F cosmid and fragmented by digestion with specific restriction endonucleases. The digests are religated to subclone individual restriction fragments. The subclones are then subdivided by overlapping deletions and used for sequencing. We present the nucleotide sequences of two cosmid inserts from chromosome IV of Drosophila (containing the ci gene and the 5' end of the zfh-2 gene) that were determined by this method. This is the first report of successful sequencing of long genomic fragments by the use of overlapping deletions. Our calculations show that, with the present approach, sequence data can be acquired at a rate comparable to the shotgun approach but with significantly reduced numbers (approximately 30%) of sequencing runs. Hence, the use of ordered deletions should allow significant savings in both the amount and cost of sequencing work.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cosmids / genetics
  • DNA Transposable Elements / genetics
  • DNA-Binding Proteins / genetics
  • Drosophila Proteins*
  • Drosophila melanogaster / genetics*
  • Genes, Insect / genetics
  • Genetic Vectors / genetics
  • Genome
  • Molecular Sequence Data
  • Restriction Mapping
  • Sequence Analysis, DNA / methods*
  • Sequence Deletion*
  • Transcription Factors

Substances

  • DNA Transposable Elements
  • DNA-Binding Proteins
  • Drosophila Proteins
  • Transcription Factors
  • ci protein, Drosophila
  • Zfh2 protein, Drosophila

Associated data

  • GENBANK/U66884
  • GENBANK/U66885
  • GENBANK/U87107
  • GENBANK/U87286