Evolution of Antifreeze Glycoprotein Gene From a Trypsinogen Gene in Antarctic Notothenioid Fish

Proc Natl Acad Sci U S A. 1997 Apr 15;94(8):3811-6. doi: 10.1073/pnas.94.8.3811.

Abstract

Freezing avoidance conferred by different types of antifreeze proteins in various polar and subpolar fishes represents a remarkable example of cold adaptation, but how these unique proteins arose is unknown. We have found that the antifreeze glycoproteins (AFGPs) of the predominant Antarctic fish taxon, the notothenioids, evolved from a pancreatic trypsinogen. We have determined the likely evolutionary process by which this occurred through characterization and analyses of notothenioid AFGP and trypsinogen genes. The primordial AFGP gene apparently arose through recruitment of the 5' and 3' ends of an ancestral trypsinogen gene, which provided the secretory signal and the 3' untranslated region, respectively, plus de novo amplification of a 9-nt Thr-Ala-Ala coding element from the trypsinogen progenitor to create a new protein coding region for the repetitive tripeptide backbone of the antifreeze protein. The small sequence divergence (4-7%) between notothenioid AFGP and trypsinogen genes indicates that the transformation of the proteinase gene into the novel ice-binding protein gene occurred quite recently, about 5-14 million years ago (mya), which is highly consistent with the estimated times of the freezing of the Antarctic Ocean at 10-14 mya, and of the main phyletic divergence of the AFGP-bearing notothenioid families at 7-15 mya. The notothenioid trypsinogen to AFGP conversion is the first clear example of how an old protein gene spawned a new gene for an entirely new protein with a new function. It also represents a rare instance in which protein evolution, organismal adaptation, and environmental conditions can be linked directly.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Antifreeze Proteins
  • Base Sequence
  • Evolution, Molecular*
  • Fishes / genetics*
  • Gene Amplification
  • Glycoproteins / genetics*
  • Molecular Sequence Data
  • Repetitive Sequences, Nucleic Acid
  • Trypsinogen / genetics*

Substances

  • Antifreeze Proteins
  • Glycoproteins
  • Trypsinogen

Associated data

  • GENBANK/U58835
  • GENBANK/U58867
  • GENBANK/U58868
  • GENBANK/U58944
  • GENBANK/U58945