Short inverse complementary amino acid sequences generate protein complexity

C R Biol. 2003 Mar;326(3):339-48. doi: 10.1016/s1631-0691(03)00077-5.

Abstract

Inversions of short genomic sequences play a central role in the generation of protein complexity. More than half of the 1300 motifs registered in ProSite have protein inverse complementary sequences (princoms) among proteins registered in SwissProt. The observed number of princoms occurrences exceeds by far the expected number (p < 10(-10)). Princoms often endow their host proteins with a whole new range of biochemical and physiological capabilities, including the possibility of intramolecular and intermolecular disulfide bond formation. These results support the idea that, like the duplications, the inversions of small genomic fragments have been a fundamental mechanism for shaping genomes.

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence*
  • Animals
  • Apoptosis
  • Base Sequence
  • DNA / chemistry
  • Databases, Protein
  • Disulfides / chemistry
  • Evolution, Molecular
  • Genes
  • Hemoglobins / chemistry
  • Humans
  • Immunoglobulin G / chemistry
  • Mathematics
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / genetics
  • von Willebrand Factor / chemistry

Substances

  • Disulfides
  • Hemoglobins
  • Immunoglobulin G
  • Proteins
  • von Willebrand Factor
  • DNA