A program toolkit for the analysis of regulatory regions of genes

Methods Mol Biol. 2006:338:135-52. doi: 10.1385/1-59745-097-9:135.

Abstract

A major challenge in systems biology is to discover and reconstruct the cis-regulatory networks through which the expression of genes is controlled. Even though a variety of sequences have been shown to interact with the transcription factors that bind DNA, extensive work is needed to discover and classify regulatory "codes" and to elucidate the role played by the sequence context of genomic DNA in the regulation of genes. Databases of sequence elements extracted from regulatory regions may facilitate this process. This report provides a Toolkit and instructions for creating a database for collecting and analyzing 9-base elements (9-mers) from a large collection of DNA sequences. A reference set consisting of all possible 9-mers is included for extracting potential control elements, irrespective of their orientation and order in DNA.

MeSH terms

  • Binding Sites / genetics
  • DNA / genetics
  • DNA / metabolism
  • Databases, Nucleic Acid*
  • Genes, Regulator*
  • Genome, Human
  • Genomics / statistics & numerical data
  • Humans
  • Software*
  • Systems Biology
  • Transcription Factors / metabolism

Substances

  • Transcription Factors
  • DNA