Cloning and sequence analysis of a new cellulase gene encoding CelK, a major cellulosome component of Clostridium thermocellum: evidence for gene duplication and recombination

J Bacteriol. 1999 Sep;181(17):5288-95. doi: 10.1128/JB.181.17.5288-5295.1999.

Abstract

The cellulolytic and hemicellulolytic complex of Clostridium thermocellum, termed cellulosome, consists of up to 26 polypeptides, of which at least 17 have been sequenced. They include 12 cellulases, 3 xylanases, 1 lichenase, and CipA, a scaffolding polypeptide. We report here a new cellulase gene, celK, coding for CelK, a 98-kDa major component of the cellulosome. The gene has an open reading frame (ORF) of 2,685 nucleotides coding for a polypeptide of 895 amino acid residues with a calculated mass of 100,552 Da. A signal peptide of 27 amino acid residues is cut off during secretion, resulting in a mature enzyme of 97,572 Da. The nucleotide sequence is highly similar to that of cbhA (V. V. Zverlov et al., J. Bacteriol. 180:3091-3099, 1998), having an ORF of 3,690 bp coding for the 1,230-amino-acid-residue CbhA of the same bacterium. Homologous regions of the two genes are 86.5 and 84.3% identical without deletion or insertion on the nucleotide and amino acid levels, respectively. Both have domain structures consisting of a signal peptide, a family IV cellulose binding domain (CBD), a family 9 glycosyl hydrolase domain, and a dockerin domain. A striking distinction between the two polypeptides is that there is a 330-amino-acid insertion in CbhA between the catalytic domain and the dockerin domain containing a fibronectin type 3-like domain and family III CBD. This insertion, missing in CelK, is responsible for the size difference between CelK and CbhA. Upstream and downstream flanking sequences of the two genes show no homology. The data indicate that celK and cbhA in the genome of C. thermocellum have evolved through gene duplication and recombination of domain coding sequences. celK without a dockerin domain was expressed in Escherichia coli and purified. The enzyme had pH and temperature optima at 6.0 and 65 degrees C, respectively. It hydrolyzed p-nitrophenyl-beta-D-cellobioside with a Km and a Vmax of 1.67 microM and 15.1 U/mg, respectively. Cellobiose was a strong inhibitor of CelK activity, with a Ki of 0.29 mM. The enzyme was thermostable, after 200 h of incubation at 60 degrees C, 97% of the original activity remained. Properties of the enzyme indicated that it is a cellobiohydrolase.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Cellulase / genetics*
  • Cellulase / metabolism
  • Cellulose 1,4-beta-Cellobiosidase
  • Cloning, Molecular
  • Clostridium / enzymology*
  • Clostridium / genetics
  • DNA, Bacterial
  • Gene Duplication*
  • Genes, Bacterial
  • Genome, Bacterial
  • Molecular Sequence Data
  • Recombination, Genetic*
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid

Substances

  • DNA, Bacterial
  • Cellulase
  • Cellulose 1,4-beta-Cellobiosidase

Associated data

  • GENBANK/AF039030