Positional distribution of transcription factor binding sites in Arabidopsis thaliana

Sci Rep. 2016 Apr 27;6:25164. doi: 10.1038/srep25164.

Abstract

Binding of a transcription factor (TF) to its DNA binding sites (TFBSs) is a critical step to initiate the transcription of its target genes. It is therefore interesting to know where the TFBSs of a gene are likely to locate in the promoter region. Here we studied the positional distribution of TFBSs in Arabidopsis thaliana, for which many known TFBSs are now available. We developed a method to identify the locations of TFBSs in the promoter sequences of genes in A. thaliana. We found that the distribution is nearly bell-shaped with a peak at 50 base pairs (bp) upstream of the transcription start site (TSS) and 86% of the TFBSs are in the region from -1,000 bp to +200 bp with respect to the TSS. Our distribution was supported by chromatin immunoprecipitation sequencing and microarray data and DNase I hypersensitive site sequencing data. When TF families were considered separately, differences in positional preference were observed between TF families. Our study of the positional distribution of TFBSs seems to be the first in a plant.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Arabidopsis / metabolism
  • Arabidopsis Proteins / genetics
  • Arabidopsis Proteins / metabolism
  • Binding Sites
  • Chromatin Immunoprecipitation
  • Computational Biology / methods*
  • DNA, Plant / chemistry
  • DNA, Plant / metabolism*
  • Oligonucleotide Array Sequence Analysis
  • Promoter Regions, Genetic
  • Sequence Analysis, DNA
  • Transcription Factors / chemistry
  • Transcription Factors / metabolism*
  • Transcription Initiation Site

Substances

  • Arabidopsis Proteins
  • DNA, Plant
  • Transcription Factors