CATH: comprehensive structural and functional annotations for genome sequences

Nucleic Acids Res. 2015 Jan;43(Database issue):D376-81. doi: 10.1093/nar/gku947. Epub 2014 Oct 27.


The latest version of the CATH-Gene3D protein structure classification database (4.0, provides annotations for over 235,000 protein domain structures and includes 25 million domain predictions. This article provides an update on the major developments in the 2 years since the last publication in this journal including: significant improvements to the predictive power of our functional families (FunFams); the release of our 'current' putative domain assignments (CATH-B); a new, strictly non-redundant data set of CATH domains suitable for homology benchmarking experiments (CATH-40) and a number of improvements to the web pages.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein*
  • Genomics
  • Internet
  • Molecular Sequence Annotation*
  • Protein Structure, Tertiary* / genetics
  • Proteins / classification


  • Proteins