Metazoan evolution of the armadillo repeat superfamily

Cell Mol Life Sci. 2017 Feb;74(3):525-541. doi: 10.1007/s00018-016-2319-6. Epub 2016 Aug 6.

Abstract

The superfamily of armadillo repeat proteins is a fascinating archetype of modular-binding proteins involved in various fundamental cellular processes, including cell-cell adhesion, cytoskeletal organization, nuclear import, and molecular signaling. Despite their diverse functions, they all share tandem armadillo (ARM) repeats, which stack together to form a conserved three-dimensional structure. This superhelical armadillo structure enables them to interact with distinct partners by wrapping around them. Despite the important functional roles of this superfamily, a comprehensive analysis of the composition, classification, and phylogeny of this protein superfamily has not been reported. Furthermore, relatively little is known about a subset of ARM proteins, and some of the current annotations of armadillo repeats are incomplete or incorrect, often due to high similarity with HEAT repeats. We identified the entire armadillo repeat superfamily repertoire in the human genome, annotated each armadillo repeat, and performed an extensive evolutionary analysis of the armadillo repeat proteins in both metazoan and premetazoan species. Phylogenetic analyses of the superfamily classified them into several discrete branches with members showing significant sequence homology, and often also related functions. Interestingly, the phylogenetic structure of the superfamily revealed that about 30 % of the members predate metazoans and represent an ancient subset, which is gradually evolving to acquire complex and highly diverse functions.

Keywords: ARM repeat; Catenins; HEAT repeat; Importins; Metazoa; Molecular evolution; Phylogeny; Premetazoan; Protein domains.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Armadillo Domain Proteins / chemistry
  • Armadillo Domain Proteins / classification
  • Armadillo Domain Proteins / genetics*
  • Armadillo Domain Proteins / metabolism
  • Biological Evolution
  • Evolution, Molecular
  • Humans
  • Models, Molecular
  • Phylogeny*
  • Protein Conformation
  • Sequence Alignment

Substances

  • Armadillo Domain Proteins