The evolution of filamin-a protein domain repeat perspective

J Struct Biol. 2012 Sep;179(3):289-98. doi: 10.1016/j.jsb.2012.02.010. Epub 2012 Mar 10.

Abstract

Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Cluster Analysis
  • Consensus Sequence
  • Contractile Proteins / chemistry*
  • Contractile Proteins / genetics*
  • Evolution, Molecular
  • Filamins
  • Humans
  • Markov Chains
  • Microfilament Proteins / chemistry*
  • Microfilament Proteins / genetics*
  • Models, Genetic
  • Models, Molecular
  • Molecular Sequence Data
  • Phylogeny
  • Protein Binding
  • Protein Interaction Domains and Motifs
  • Protein Structure, Secondary
  • Repetitive Sequences, Amino Acid
  • Sequence Analysis, Protein

Substances

  • Contractile Proteins
  • Filamins
  • Microfilament Proteins