Massive over-representation of solute-binding proteins (SBPs) from the tripartite tricarboxylate transporter (TTT) family in the genome of the α-proteobacterium Rhodoplanes sp. Z2-YC6860

Microb Genom. 2018 May;4(5):e000176. doi: 10.1099/mgen.0.000176. Epub 2018 Apr 18.

Abstract

Lineage-specific expansion (LSE) of protein families is a widespread phenomenon in many eukaryotic genomes, but is generally more limited in bacterial genomes. Here, we report the presence of 434 genes encoding solute-binding proteins (SBPs) from the tripartite tricarboxylate transporter (TTT) family, within the 8.2 Mb genome of the α-proteobacterium Rhodoplanes sp. Z2-YC6860, a gene family over-representation of unprecedented abundance in prokaryotes. Representing over 6 % of the total number of coding sequences, the SBP genes are distributed across the whole genome but are found rarely in low-GC islands, where the gene density for this family is much lower. This observation, and the much higher sequence identity between the 434 Rhodoplanes TTT SBPs compared with the average identity between homologues from different species, is indicative of a key role for LSE in the expansion. The TTT SBP genes were found in the vicinity of genes encoding membrane components of transport systems from different families, as well as regulatory proteins such as histidine-kinases and transcription factors, indicating a broad range of functions around the sensing, response and transport of organic compounds. A smaller expansion of TTT SBPs is known in some species of the β-proteobacteria Bordetella and we observed similar expansions in other β-proteobacterial lineages, including members of the genus Comamonas and the industrial biotechnology organism Cupriavidus necator, indicating that strong environmental selection can drive SBP duplication and specialisation from multiple evolutionary starting points.

Keywords: Gene duplication; Solute transporter; gene over-representation; lineage specific expansion; periplasmic-binding protein.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alphaproteobacteria / genetics
  • Alphaproteobacteria / metabolism
  • Bacterial Proteins / genetics
  • Bordetella / genetics
  • Carrier Proteins / biosynthesis*
  • Carrier Proteins / genetics*
  • Comamonas / genetics
  • Cupriavidus necator / genetics
  • Genes, Bacterial / genetics*
  • Genome Size
  • Genome, Bacterial
  • Histidine Kinase / genetics
  • Hyphomicrobiaceae / genetics*
  • Hyphomicrobiaceae / metabolism*
  • Periplasmic Binding Proteins / biosynthesis
  • Periplasmic Binding Proteins / genetics
  • Transcription Factors / genetics

Substances

  • Bacterial Proteins
  • Carrier Proteins
  • Periplasmic Binding Proteins
  • Transcription Factors
  • citrate-binding transport protein
  • Histidine Kinase