Classification of proteins with shared motifs and internal repeats in the ECOD database

Protein Sci. 2016 Jul;25(7):1188-203. doi: 10.1002/pro.2893. Epub 2016 Feb 21.

Abstract

Proteins and their domains evolve by a set of events commonly including the duplication and divergence of small motifs. The presence of short repetitive regions in domains has generally constituted a difficult case for structural domain classifications and their hierarchies. We developed the Evolutionary Classification Of protein Domains (ECOD) in part to implement a new schema for the classification of these types of proteins. Here we document the ways in which ECOD classifies proteins with small internal repeats, widespread functional motifs, and assemblies of small domain-like fragments in its evolutionary schema. We illustrate the ways in which the structural genomics project impacted the classification and characterization of new structural domains and sequence families over the decade.

Keywords: internal; protein classification; protein motifs; repeats; structural bioinformatics; structural genomics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Amino Acid Motifs*
  • Databases, Protein
  • Evolution, Molecular
  • Models, Molecular
  • Protein Domains
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteomics / methods*

Substances

  • Proteins