Identifying target sites for cooperatively binding factors

Bioinformatics. 2001 Jul;17(7):608-21. doi: 10.1093/bioinformatics/17.7.608.

Abstract

Motivation: Transcriptional activation in eukaryotic organisms normally requires combinatorial interactions of multiple transcription factors. Though several methods exist for identification of individual protein binding site patterns in DNA sequences, there are few methods for discovery of binding site patterns for cooperatively acting factors. Here we present an algorithm, Co-Bind (for COperative BINDing), for discovering DNA target sites for cooperatively acting transcription factors. The method utilizes a Gibbs sampling strategy to model the cooperativity between two transcription factors and defines position weight matrices for the binding sites. Sequences from both the training set and the entire genome are taken into account, in order to discriminate against commonly occurring patterns in the genome, and produce patterns which are significant only in the training set.

Results: We have tested Co-Bind on semi-synthetic and real data sets to show it can efficiently identify DNA target site patterns for cooperatively binding transcription factors. In cases where binding site patterns are weak and cannot be identified by other available methods, Co-Bind, by virtue of modeling the cooperativity between factors, can identify those sites efficiently. Though developed to model protein-DNA interactions, the scope of Co-Bind may be extended to combinatorial, sequence specific, interactions in other macromolecules.

Availability: The program is available upon request from the authors or may be downloaded from http://ural.wustl.edu.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Bacterial Proteins / metabolism
  • Base Sequence
  • Binding Sites / genetics
  • Computational Biology
  • DNA / genetics*
  • DNA / metabolism*
  • DNA, Bacterial / genetics
  • DNA, Bacterial / metabolism
  • DNA-Binding Proteins / metabolism*
  • Databases as Topic
  • Escherichia coli / genetics
  • Escherichia coli / metabolism
  • Genes, Fungal
  • Genome, Bacterial
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / metabolism
  • Software
  • Transcription Factors / metabolism
  • Transcriptional Activation

Substances

  • Bacterial Proteins
  • DNA, Bacterial
  • DNA-Binding Proteins
  • Transcription Factors
  • DNA