Spurious regulatory connections dictate the expression-fitness landscape of translation factors
- PMID: 33900014
- PMCID: PMC8073009
- DOI: 10.15252/msb.202110302
Spurious regulatory connections dictate the expression-fitness landscape of translation factors
Abstract
During steady-state cell growth, individual enzymatic fluxes can be directly inferred from growth rate by mass conservation, but the inverse problem remains unsolved. Perturbing the flux and expression of a single enzyme could have pleiotropic effects that may or may not dominate the impact on cell fitness. Here, we quantitatively dissect the molecular and global responses to varied expression of translation termination factors (peptide release factors, RFs) in the bacterium Bacillus subtilis. While endogenous RF expression maximizes proliferation, deviations in expression lead to unexpected distal regulatory responses that dictate fitness reduction. Molecularly, RF depletion causes expression imbalance at specific operons, which activates master regulators and detrimentally overrides the transcriptome. Through these spurious connections, RF abundances are thus entrenched by focal points within the regulatory network, in one case located at a single stop codon. Such regulatory entrenchment suggests that predictive bottom-up models of expression-fitness landscapes will require near-exhaustive characterization of parts.
Keywords: expression-fitness landscape; multiscale measurements; peptide chain release factors; regulatory entrenchment; translation factors.
© 2021 The Authors. Published under the terms of the CC BY 4.0 license.
Conflict of interest statement
The authors declare that they have no conflict of interest.
Figures
Expression‐fitness landscapes, which connect enzyme expression (microscopic variable) to the growth rate (cellular phenotype), can be dictated by direct or indirect effects. Inset (i): direct effects correspond to reduction in the flux cognate to the perturbed enzyme (protein synthesis rate in the case of translation factors). Inset (ii): indirect effects result from pleiotropic propagation across mechanistic, regulatory, and systemic levels.
As a case study, the expression of peptide chain release factors (RFs: RF1, RF2, and associated methyltransferase PrmC), involved in the first step of mRNA translation termination, was tuned around endogenous levels.
Strains with inducible copies of RFs, and deleted endogenous genes, were used to systematically vary RF expression. The resulting impacts on the cell internal state (RNA‐seq, ribosome profiling) and relative growth rate s (competition experiments) were measured, leading to precise mapping of expression‐fitness landscapes.
- A
Schematic tunable expression system. Inducible constructs are added at safe harbor loci, together with a barcode for competition experiments. The endogenous gene copy is then deleted in a scarless fashion.
- B, C
Details of loci with tunable expression cassettes for the orthogonally tunable (B) RF2 and PrmC strain, and (C) RF1 and PrmC strain. Disrupted safe harbor endogenous loci are amyE, lacA, and levB. Control strains were also constructed with blank expression cassettes at these locations (Materials and Methods, Fig EV2F, G, I and J). Inducible repressors XylR and LacI, respectively, responsive to IPTG and xylose are shown in black. Resistance cassettes are shown in yellow. The location of the 8‐nt chromosomal barcode in one arm of the amyE homology region is shown in purple.
- D
Fold‐change in RF levels as a function of inducers, with fitted Hill curve, serves as a guide to the eye. Endogenous expression is shown as the horizontal dashed line (fold‐change of 1) and the full attainable dynamic range indicated on the right. See Materials and Methods for calibration to proteome fraction.
- E, F
3D RF expression space for all phenotypically profiled conditions (shown separately in Fig 2A–D) shown in (E), and orthogonal projection in 2D subspaces in (F). Dashed lines mark endogenous expression levels.
- A
Schematic of competition experiments. Pools of barcoded strains competed for ≈ 30 generations with five samplings, barcode frequencies are quantified, and changes in barcode frequencies over time determined. Relative growth rate is λ inducible/λ WT = 1 + s, where s is the slope of the log2 barcode ratio vs. time (number of generations). This process was performed for all induction conditions shown in Fig 2A–D.
- B
Schematic the barcode readout procedure, carried out in two PCR steps from genomic DNA extracted from pools of competing strains, with UMI and first index added at the first PCR, and a second index added at the second PCR. Details of the final amplicon for barcode readout are in Dataset EV6.
- C, D
Examples of barcode frequency ratios over time for isogenic strain pairs from a single competition experiment, (C) wild‐type vs. wild‐type, and (D) RF2 overexpression vs. wild‐type. Representative strain pairs from experiment E1‐C9 are shown. Inferred s: = λ inducible/λ WT − 1 from the linear fit (black line) is shown on the graph. Range of slopes smin–smax of subsampled bootstraps (gray lines) is reported as . Dashed lines correspond to 95% confidence interval. Error bars correspond to estimated noise attributable from Poisson counting noise in UMI counts (, where N 1 and N 2 are the respective UMI barcode counts for the compared strain pairs at the corresponding time point).
- E
Distribution of measured s for pairs of strains with identical genotype apart from barcode across all experiments (n = 1,253 comparisons, e.g., 4/1,253 experimentally determined s are shown in panel C). The shaded gray area corresponds to the ± 2σ s = ± 1.2% shown in Fig 2E–H.
- F, G
Measured fitness difference to wild‐type for strains with blank expression cassettes. (F) Blank Pxyl at amyE & blank PspankHy at lacA, strains GLB434–437, n = 644 comparisons. (G) Blank PspankHy at amyE & blank Pxyl at levB, strains GLB446–449, n = 252 comparisons. Both control strain series show minimal effect of ectopic insertions on cell fitness. s values displayed correspond to median with 25th and 75th percentile of values across isogenic pairs.
- H
Hand mixing experiment with two strains with different barcodes, showing accurate (slopes 1.00 and 0.98 for observed vs. expected ratio of barcodes from two technical replicates) readout of cell frequencies in pool over nearly four orders of magnitude. Median difference between readout from two technical replicates is 20%. Error bars are as in (C and D).
- I, J
Representative examples of mRNA level (RNA‐seq) comparison between wild‐type and control strains with blank expression cassette insertions: (I) GLB434 vs. wild‐type (experiment E1‐C5), and (J) GLB446 vs. wild‐type (experiment E2‐C1). Cumulative distributions of fold‐changes are shown as insets as in Fig 3A and B.
- A–D
Profiled orthogonal directions of the RF expression subspace, respectively, scanning along the dimension of (A) RF1, (B) RF2, (C) PrmC, and (D) PrmC with RF2 overexpression. Axes correspond to expression levels of RF1, RF2, and PrmC.
- E–H
Cell exponential growth rate s measured by competition (relative to wild‐type) at corresponding RF levels (reported in units of proteome fraction, derived from a ribosome profiling calibration, Materials and Methods) shown in (A–D). Endogenous levels of RFs are indicated with dashed vertical lines. Gray shadings mark the precision of our fitness measurement, defined as ± 2σ s = ± 1.2%, where σ s is the standard deviation in the measured relative growth rate among isogenic redundantly barcoded strain (see Fig EV2E), with the distribution shown as inset in panel (H). Relative fitness value reported corresponds to the median across isogenic barcoded pairs, and vertical black bars delineate 25th to 75th percentile among such pairs (between 21 and 28 isogenic pairs, see Dataset EV6) for a single experiment and is typically smaller than the plot symbol.
- I–L
Trajectories following RF perturbation in the space of relative growth rate vs. estimated proteome fraction to translation proteins (translation sector).
mRNA levels (reads per million mapped reads per kilobase, rpkm, genes with > 5 reads mapped shown) for maximal PrmC overexpression (green arrow) versus unperturbed cells (average across control datasets, Materials and Methods). σ B regulon members and translation‐related proteins are marked in red × and dark gray +, respectively. Targets for RT–qPCR measurements of σ B induction (ygxB, ywzA, Fig 5) are highlighted in dark red. mRNA levels for RF1 (blue), RF2 (orange), and PrmC (green) are marked by dots (corrected for translation efficiency of ectopic expression constructs, Materials and Methods). σ B regulon activation is marked by dashed red polygon. Inset shows cumulative distribution of fold‐changes in mRNA levels (red σ B regulon, dark gray translation, pale gray rest of proteins). σ B activation and translation compression are highlighted by arrows indicating shift in median expression. As a comparison, distribution of fold‐changes for all genes among unperturbed replicates are show in light blue.
Same as (A), but with a strain harboring a deletion of gene sigB, which abrogates σ B regulon activation, and restores genome‐wide expression levels despite PrmC overexpression (light gray line in inset).
Quantification of the proteome fraction to the σ B regulon (σ B sector) as a function of PrmC (see Materials and Methods for calibration from transcriptome to proteome fraction). Dashed vertical line marks endogenous PrmC level. Inset reproduces broader context of data in RF expression subspace (Fig 2C).
Similar to (C), but quantifying proteome fraction to the translation sector.
Proteome fraction of the translation sector as a function of the excess proteome fraction to the σ B regulon, denoted . Dashed line corresponds to growth laws prediction (Appendix Supplementary Methods, equation 1, using parameters , , , and obtained from fits in Fig 2J and K), full line corresponds to decrease by factor 1 − ϕ U.
Schematic illustration of passive proteome fraction compression under σ B activation (increase in regulon expression).
Relative growth rate as a function of PrmC level, with and without sigB.
Growth difference with and without sigB (corresponding to ∆s panel G), as a function of excess proteome fraction to the σ B regulon. Dashed lines are growth law prediction (Appendix Supplementary Methods, equation 2), full line corresponds to −ϕ U.
- A–I
Analogous to Fig 3, but for varying PrmC levels in conjunction with RF2 overexpression (conditions shown in Fig 2D). In (E–H), open light green pentagrams correspond to cells with sigB, and filled dark green hexagrams to cells without sigB (deletion). Blue shadings mark the region of the expression space for which the growth defect is not rescued by sigB deletion, indicating a different underlying cause for the decrease in translation sector.
- J
Comparison of expression at maximal PrmC expression for endogenous and overexpressed RF2 levels (respective comparisons to unperturbed conditions in Fig 3A and current panel C), showing highly reproducible σ B induction independent of RF2 levels (the two outliers marked by black circles are xylA and xylB, which are responsive to xylose).
- K
Expression‐fitness landscape for RF2 and PrmC.
- L–N
orthogonal projections from K showing the (L) RF2, and (M, N) PrmC directions. Panel (N) is the PrmC fitness landscape, with the fitness defect caused by RF2 overexpression defect subtracted out (arrows in panels L and M). Fitness defect at overexpressed PrmC is independent of RF2 (dashed black line in N). Knockdown defect is exacerbated by RF2 overexpression (black arrows in N).
- O, P
Transcriptome under RF2 expression perturbation. (O) RF2 knockdown shows modest σ B induction, whereas (P) maximal RF2 overexpression displays little expression changes.
- Q
Growth rate difference for strain with inducible RF2, with and without sigB. A mild but significant (P < 10−5, bootstrap subsampling, Materials and Methods) improvement in fitness upon sigB deletion at lowest RF2 levels is seen. Measured s for each of 12 strain pairs, inducible RF2 (GLB426 to GLB429) vs. inducible RF2 without sigB (GLB430 to GLB433), are shown for all profiled RF2 levels, with the median and 25th to 75th percentile marked by black lines (error bars). Red marks the condition with lowest RF2 level. Inset shows cumulative distribution of relative fitness difference for lowest RF2 level (red) and rest of conditions (black). The fitness rescue upon sigB deletion (median increase in fitness ∆s = 0.009) is commensurate with the estimated excess σ B regulon proteome fraction ( = 0.0085) in this condition.
- A
Profiled RF1 levels in the RF expression subspace (reproduction of Fig 2A).
- B
Fitness defect upon modulation of RF1 level, with (open pale blue squares) and without sigB (filled dark blue circles), showing no strong influence of σ B.
- C
Quantification of various regulons' transcriptome fraction as a function of RF1 levels (shown for strain with sigB). The large fitness decrease upon RF1 knockdown coincides with decrease in motility (SigD, cyan) regulon and autolysin operon (magenta), and increase in biofilm matrix eps (light green) genes production.
- D, E
mRNA levels (rpkm, genes with > 5 reads mapped shown) comparison to unperturbed for maximal RF1 knockdown, with regulon members colored following (c), for strains (D) with or (E) without sigB. Median fold‐change is larger or equal to 5 for SigD, eps, and lyt genes, independently from σ B. Cumulative distributions of fold‐change for highlighted regulons (rest of genes in gray, all‐to‐all for across unperturbed replicates in pale blue) are shown as insets as in Fig 3A.
- A–C
Heatmaps of all data used to generate metagene ribosome queuing plot shown in Fig 4A for (A) wild‐type, (B) RF1/PrmC CRISPRi depletion, and (C) RF2 CRISPRi depletion. Genes are separated by stop codon. Each horizontal line represents a gene, and gene‐normalized ribosome footprint density (center‐mapped) is shown as gray scale, horizontally aligned by the position of the stop codon (5′–3′ left to right). Genes are organized in increasing order of translation efficiency moving up (TE). Queues upstream of stop codons with perturbed RFs can be seen (colored boxes in B and C) and are longer for genes with high TE. Scale bars indicate 25 nt and 100 genes. Caret ▲ marks the position of stop codons.
- D
Control analysis of changes in expression stoichiometry under RF depletion for co‐directional genes within 30 bp, but stratified by the stop codon of the downstream gene (RF1/PrmC: FCUAG = 0.99, P = 0.39; RF2: FCUGA = 0.98, P = 0.27, P‐value from stop codon reshufflings, Materials and Methods).
- E
Similar to (D), but for co‐directional genes separated by more than 30 bp, stratifying by the upstream gene (RF1/PrmC: FCUAG = 1.01, P = 0.60; RF2: FCUGA = 1.04, P = 0.96, P‐value from stop codon reshufflings, Materials and Methods).
- F
Analysis for expression stoichiometry of gene pairs paralleling Fig 4D, but with Rend‐seq data, showing no effect (RF1/PrmC: FCUAG = 1.00, P = 0.59; RF2: FCUGA = 1.02, P = 0.98, P‐value from stop codon reshufflings, Materials and Methods). This further suggests that perturbed expression stoichiometry results from changes in translation.
- G
Distributions of fold‐change in mRNA levels between wild‐type and RF depletion stratified by stop codon, showing small (≈ 2%) changes in median for RF‐perturbed stop (RF1/PrmC: FCUAG = 0.98, P = 0.06; RF2: FCUGA = 0.98, P = 0.02, P‐value from stop codon reshufflings, Materials and Methods).
- H
Schematic illustrating how decreasing termination rate leads to ribosome queues on mRNAs with high translation efficiency (ribosome initiation rate), see Appendix Supplementary Methods.
- I
Fold‐change in expression stoichiometry for gene pairs considered in Fig 4E (within 30 bp and stratified by upstream stop codon, subset with measured TE shown) as a function of TE. Significant (F‐test from MATLAB's regress function, P < 0.05) correlations (increasing translation efficiency leading to more severe effect, R 2 ≈ 0.1) are seen for genes with stop cognate to the RF perturbation.
- J
Cumulative number of co‐directional gene pairs separated by given distance in Bacillus subtilis, stratified by stop codon identity. Overlap
A UG A (arrow) is the most common configuration. - K
Same as Fig 4D, but with UGA pairs split between those with
A UG A overlap or not, with the overall effect distribution is similar between the two types of overlaps.
- A
Metagene trace (Materials and Methods) of gene‐normalized ribosome footprint read density (center‐mapped, genes with footprint density > 0.5/nt) upstream of stop codons stratified by stop codon. Wild‐type (top), RF1/PrmC depletion (middle), and RF2 depletion (bottom) are shown. Ribosome queues upstream of stop codon cognate to RF perturbations can be seen.
- B
Translation readthrough score for isolated genes (Materials and Methods) under acute R1/PrmC and RF2 CRISPRi knockdown. Points in beeswarm plot correspond to individual genes, overlaid box plot highlighting the interquartile range (25th to 75th percentile, median red mark). Fivefold increase in translational readthrough is seen for genes terminating with the stop codon cognate to the knocked‐down RF (arrows).
- C, D
Comparison of expression (ribosome footprint density, rpkm) for (C) RF1/PrmC depletion, and (D) RF2 depletion, vs. wild‐type. Regulons and RF are marked as in Fig 3A. RF2 depletion leads to σ B activation, in contrast to RF1/PrmC depletion.
- E
Fold‐change in the downstream‐to‐upstream expression (ribosome profiling) stoichiometry upon RF depletion for co‐directional genes within 30 bp, and stratified by the stop codon of the upstream gene. A mild but systematic and stop codon‐specific decrease in downstream gene expression is seen (RF1/PrmC: median UAG fold‐change compared with UAA (FCUAG) = 0.88, P < 10−6; RF2: FCUGA = 0.82, P < 10−6, P‐values from stop codon reshufflings, Materials and Methods). We ascribe this effect to obstruction of downstream ribosome initiation by idle upstream terminating ribosomes. Some protein pairs, such as RsbW/RsbV, are affected more strongly.
- F
Fold‐change in expression between RF depletion and wild‐type stratified by stop codon of genes. No systematic effect for the different stop codons is seen (RF1/PrmC: FCUAG = 1.01, P = 0.69; RF2: FCUGA = 1.02, P = 0.86, P‐values from stop codon reshufflings, Materials and Methods), indicating lack of strong change in expression for genes with compromised translation termination.
Schematic of simplified σ B regulatory architecture and operon structure.
Different rsbV stop codon variants considered. Top row shows endogenous configuration rsbV UGA, with RF2‐dependent stop codon and
A UG A open reading frames overlap. Middle row shows the UAA RF‐agnostic variant rsbV UAA, obtained by adding AAT (underlined). Bottom row shows the RF1‐dependent UAG allele rsbV UAG, obtained by inserting AGAT (underlined).Average of reporter gene log2 fold‐change compared with wild‐type for σ B reporter genes (ywzA and ygxB, highlighted in Fig 3A) as quantified by RT–qPCR (2–3 independent biological replicates for each allele/condition, ± indicates s.e.m. over replicates for the two reporter genes, raw data in Dataset EV8, Materials and Methods) showing strong induction of reporter genes in the RF perturbations cognate to the stop codon of the rsbV variant. Rows correspond to rsbV stop variant in panel (B) and columns to RF expression conditions. RF‐inducible measurements were done in strains with orthogonally tunable RF1/PrmC (IPTG) and RF2 (xylose; Materials and Methods).
Similar articles
-
Release factor-dependent ribosome rescue by BrfA in the Gram-positive bacterium Bacillus subtilis.Nat Commun. 2019 Nov 27;10(1):5397. doi: 10.1038/s41467-019-13408-7. Nat Commun. 2019. PMID: 31776341 Free PMC article.
-
Translation elicits a growth rate-dependent, genome-wide, differential protein production in Bacillus subtilis.Mol Syst Biol. 2016 May 17;12(5):870. doi: 10.15252/msb.20156608. Mol Syst Biol. 2016. PMID: 27193784 Free PMC article.
-
Genome-wide identification of genes directly regulated by the pleiotropic transcription factor Spx in Bacillus subtilis.Nucleic Acids Res. 2012 Oct;40(19):9571-83. doi: 10.1093/nar/gks755. Epub 2012 Aug 16. Nucleic Acids Res. 2012. PMID: 22904090 Free PMC article.
-
A proteomic view of cell physiology of Bacillus subtilis--bringing the genome sequence to life.Adv Biochem Eng Biotechnol. 2003;83:57-92. doi: 10.1007/3-540-36459-5_3. Adv Biochem Eng Biotechnol. 2003. PMID: 12934926 Review.
-
Termination of protein synthesis.Mol Biol Rep. 1994 May;19(3):171-81. doi: 10.1007/BF00986959. Mol Biol Rep. 1994. PMID: 7969105 Review.
Cited by
-
Protein overabundance is driven by growth robustness.bioRxiv [Preprint]. 2024 Aug 17:2024.08.14.607847. doi: 10.1101/2024.08.14.607847. bioRxiv. 2024. PMID: 39185236 Free PMC article. Preprint.
-
First-principles model of optimal translation factors stoichiometry.Elife. 2021 Sep 30;10:e69222. doi: 10.7554/eLife.69222. Elife. 2021. PMID: 34590582 Free PMC article.
-
The evolution and functional significance of the programmed ribosomal frameshift in prfB.bioRxiv [Preprint]. 2024 Sep 25:2024.09.24.614795. doi: 10.1101/2024.09.24.614795. bioRxiv. 2024. PMID: 39386688 Free PMC article. Preprint.
-
Non-linear transcriptional responses to gradual modulation of transcription factor dosage.bioRxiv [Preprint]. 2024 Aug 6:2024.03.01.582837. doi: 10.1101/2024.03.01.582837. bioRxiv. 2024. PMID: 38464330 Free PMC article. Preprint.
-
Sigma factor dependent translational activation in Bacillus subtilis.RNA. 2021 Apr 29;27(7):791-804. doi: 10.1261/rna.078747.121. Online ahead of print. RNA. 2021. PMID: 33927010 Free PMC article.
References
Publication types
MeSH terms
Substances
Associated data
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
