dbCAN: a web resource for automated carbohydrate-active enzyme annotation

Nucleic Acids Res. 2012 Jul;40(Web Server issue):W445-51. doi: 10.1093/nar/gks479. Epub 2012 May 29.


Carbohydrate-active enzymes (CAZymes) are very important to the biotech industry, particularly the emerging biofuel industry because CAZymes are responsible for the synthesis, degradation and modification of all the carbohydrates on Earth. We have developed a web resource, dbCAN (http://csbl.bmb.uga.edu/dbCAN/annotate.php), to provide a capability for automated CAZyme signature domain-based annotation for any given protein data set (e.g. proteins from a newly sequenced genome) submitted to our server. To accomplish this, we have explicitly defined a signature domain for every CAZyme family, derived based on the CDD (conserved domain database) search and literature curation. We have also constructed a hidden Markov model to represent the signature domain of each CAZyme family. These CAZyme family-specific HMMs are our key contribution and the foundation for the automated CAZyme annotation.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Carbohydrate Metabolism*
  • Enzyme Activation
  • Enzymes / chemistry*
  • Enzymes / classification
  • Enzymes / metabolism
  • Internet
  • Metagenome
  • Molecular Sequence Annotation*
  • Protein Structure, Tertiary
  • Sequence Alignment
  • Software*


  • Enzymes