Sequence analysis and editing for bisulphite genomic sequencing projects

Nucleic Acids Res. 2007;35(10):e79. doi: 10.1093/nar/gkm330. Epub 2007 May 21.

Abstract

Bisulphite genomic sequencing is a widely used technique for detailed analysis of the methylation status of a region of DNA. It relies upon the selective deamination of unmethylated cytosine to uracil after treatment with sodium bisulphite, usually followed by PCR amplification of the chosen target region. Since this two-step procedure replaces all unmethylated cytosine bases with thymine, PCR products derived from unmethylated templates contain only three types of nucleotide, in unequal proportions. This can create a number of technical difficulties (e.g. for some base-calling methods) and impedes manual analysis of sequencing results (since the long runs of T or A residues are difficult to align visually with the parent sequence). To facilitate the detailed analysis of bisulphite PCR products (particularly using multiple cloned templates), we have developed a visually intuitive program that identifies the methylation status of CpG dinucleotides by analysis of raw sequence data files produced by MegaBace or ABI sequencers as well as Staden SCF trace files and plain text files. The program then also collates and presents data derived from independent templates (e.g. separate clones). This results in a considerable reduction in the time required for completion of a detailed genomic methylation project.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Graphics
  • CpG Islands*
  • DNA Methylation*
  • Genomics / methods*
  • Polymerase Chain Reaction
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Software*
  • Sulfites / chemistry*

Substances

  • Sulfites
  • hydrogen sulfite