Distribution and function of new bacterial intein-like protein domains

Mol Microbiol. 2003 Jan;47(1):61-73. doi: 10.1046/j.1365-2958.2003.03283.x.

Abstract

Hint protein domains appear in inteins and in the C-terminal region of Hedgehog and Hedgehog-like animal developmental proteins. Intein Hint domains are responsible and sufficient for protein-splicing of their host-protein flanks. In Hedgehog proteins the Hint domain autocatalyses its cleavage from the N-terminal domain of the Hedgehog protein by attaching a cholesterol molecule to it. We identified two new types of Hint domains. Both types have active site sequence features of Hint domains but also possess distinguishing sequence features. The new domains appear in more than 50 different proteins from diverse bacteria, including pathogenic species of humans and plants, such as Neisseria meningitidis and Pseudomonas syringae. These new domains are termed bacterial intein-like (BIL) domains. Bacterial intein-like domains are present in variable protein regions and are typically flanked by domains that also appear in secreted proteins such as filamentous haemagglutinin and calcium binding RTX repeats. Phylogenetic and genomic analysis of BIL sequences suggests that they were positively selected for in different lineages. We cloned two BIL domains of different types and showed them to be active. One of the domains efficiently cleaved itself from its C-terminal flank and could also protein-splice its two flanks, in E. coli and in a cell free system. We discuss several possible biological roles for BIL domains including microevolution and post translational modification for generating protein variability.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / metabolism
  • Gene Expression
  • Molecular Sequence Data
  • Protein Processing, Post-Translational*
  • Protein Splicing / physiology*
  • Protein Structure, Tertiary / physiology*
  • Pseudomonas / metabolism

Substances

  • Bacterial Proteins