Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Apr;40(8):3524-37.
doi: 10.1093/nar/gkr1236. Epub 2011 Dec 17.

Genomic analysis of DNA binding and gene regulation by homologous nucleoid-associated proteins IHF and HU in Escherichia coli K12

Affiliations

Genomic analysis of DNA binding and gene regulation by homologous nucleoid-associated proteins IHF and HU in Escherichia coli K12

Ana I Prieto et al. Nucleic Acids Res. 2012 Apr.

Abstract

IHF and HU are two heterodimeric nucleoid-associated proteins (NAP) that belong to the same protein family but interact differently with the DNA. IHF is a sequence-specific DNA-binding protein that bends the DNA by over 160°. HU is the most conserved NAP, which binds non-specifically to duplex DNA with a particular preference for targeting nicked and bent DNA. Despite their importance, the in vivo interactions of the two proteins to the DNA remain to be described at a high resolution and on a genome-wide scale. Further, the effects of these proteins on gene expression on a global scale remain contentious. Finally, the contrast between the functions of the homo- and heterodimeric forms of proteins deserves the attention of further study. Here we present a genome-scale study of HU- and IHF binding to the Escherichia coli K12 chromosome using ChIP-seq. We also perform microarray analysis of gene expression in single- and double-deletion mutants of each protein to identify their regulons. The sequence-specific binding profile of IHF encompasses ∼30% of all operons, though the expression of <10% of these is affected by its deletion suggesting combinatorial control or a molecular backup. The binding profile for HU is reflective of relatively non-specific binding to the chromosome, however, with a preference for A/T-rich DNA. The HU regulon comprises highly conserved genes including those that are essential and possibly supercoiling sensitive. Finally, by performing ChIP-seq experiments, where possible, of each subunit of IHF and HU in the absence of the other subunit, we define genome-wide maps of DNA binding of the proteins in their hetero- and homodimeric forms.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) The left panels show the distribution of read counts (x-axis truncated at 20) for IHF (blue), HU (green) and the mock-IP (black). The right panels show the correlation (at single-base resolution) in read counts between the mock-IP (x-axis) and HupA (green) or IhfA (blue) ChIP-seq (y-axis). (B) The left panels show the base-level correlation in read counts between IhfA and IhfB (blue), and HupA and HupB (green). The above are all for exponential phase data. The right panels show similar correlations for the same protein, but between the two exponential phase time-points. The signal is the number of reads mapping to a given base position divided by the total number of reads in the sample (multiplied by a factor 107).
Figure 2.
Figure 2.
(A) Comparison of mock-IP-subtracted binding signals (log-scale of the number of reads mapping to a given position normalized by the total number of reads from that sequencing experiment multiplied by a factor 107) for IhfA (blue) and HupA (green). For HupA, the distribution is centered around zero with a slight right tail; this is similar to what might be expected when a simulated replicate of the mock-IP signal is subtracted from the reference mock-IP (black line; where each data point for the in silico replicate is derived from a normal distribution with mean equal to the signal on the mock-IP and standard deviation equal to that across the mock-IP dataset). On the other hand, for IhfA, the distribution has a strong offset from zero, with many points being below zero indicating lack of binding and a considerable number above zero indicating strong signal. (B) Tracks—rendered in Artemis—showing binding signals for IHF (blue) and HU (green) across an ∼70 kb region of the genome. (C) A zoomed-in image of a portion of B, showing an ∼8 kb region of the genome.
Figure 3.
Figure 3.
(A) Comparison of binding signals, represented by Z-scores as described in Kahramanoglou et al. (4), for IhfA, IhfB and Fis. This shows that binding signals for IHF are considerably higher than those for Fis (B) Weblogo representing the binding motifs for IHF.
Figure 4.
Figure 4.
(A) Correlation between HU binding signal (as the median of read counts across the gene body, where read counts are divided by the total number of reads obtained for that sample and multiplied by a factor of 107; this number was then subtracted by the corresponding value from the mock-IP experiment) and A/T content (as a fraction of the total number of bases). These are for exponential phase data. (B) A/T content of regions of enriched mock-IP-subtracted ChIP-seq signal for HupA and HupB [modified from Kind et al. (45)], compared to the A/T content of randomly picked regions of the same length as regions of enriched signal (marked as controls). (C) Best motif identified by MEME for HupA and HupB.
Figure 5.
Figure 5.
Venn diagrams showing the degree of overlap in numbers of genes that are differentially expressed in the various hup deletions at different time-points.
Figure 6.
Figure 6.
Pie charts showing the proportions of genes in various sets (up or downregulated in hupAB, hupA or hupB across any of the four time-points) that are essential, involved in ribosome biogenesis and translation, and motility. Also shown are plots displaying the degree of conservation (as determined by the presence of bi-directional best-hit FASTA orthologs across 380 prokaryotic genomes) and DNA gyrase binding signal [from Jeong et al. (40)]; in these plots ‘X’ is the median of the distribution and ‘+’ shows the first and the third quartiles. Red shows statistical enrichment (pie charts) or statistically higher medians (distributions) when compared to the reference set of all genes (blue).

Similar articles

Cited by

References

    1. Dillon SC, Dorman CJ. Bacterial nucleoid-associated proteins, nucleoid structure and gene expression. Nature reviews. Microbiology. 2010;8:185–195. - PubMed
    1. Azam TA, Ishihama A. Twelve species of the nucleoid-associated protein from Escherichia coli. Sequence recognition specificity and DNA binding affinity. J. Biol. Chem. 1999;274:33105–33113. - PubMed
    1. Ali Azam T, Iwata A, Nishimura A, Ueda S, Ishihama A. Growth phase-dependent variation in protein composition of the Escherichia coli nucleoid. J. Bacteriol. 1999;181:6361–6370. - PMC - PubMed
    1. Kahramanoglou C, Seshasayee ASN, Prieto AI, Ibberson D, Schmidt S, Zimmermann J, Benes V, Fraser GM, Luscombe NM. Direct and indirect effects of H-NS and Fis on global gene expression control in Escherichia coli. Nucleic Acids Res. 2011;39:2073–2091. - PMC - PubMed
    1. Grainger DC, Hurd D, Goldberg MD, Busby SJW. Association of nucleoid proteins with coding and non-coding segments of the Escherichia coli genome. Nucleic Acids Res. 2006;34:4642–4652. - PMC - PubMed

Publication types

MeSH terms