A phylogenetic analysis of the lipocalin protein family

Mol Biol Evol. 2000 Jan;17(1):114-26. doi: 10.1093/oxfordjournals.molbev.a026224.

Abstract

The lipocalins are a family of extracellular proteins that bind and transport small hydrophobic molecules. They are found in eubacteria and a great variety of eukaryotic cells, in which they play diverse physiological roles. We report here the detection of two new eukaryotic lipocalins and a phylogenetic analysis of 113 lipocalin family members performed with maximum-likelihood and parsimony methods on their amino acid sequences. Lipocalins segregate into 13 monophyletic clades, some of which are grouped in well-supported superclades. An examination of the G + C content of the bacterial lipocalin genes and the detection of four new conceptual lipocalins in other eubacterial species argue against a recent horizontal transfer as the origin of prokaryotic lipocalins. Therefore, we rooted our lipocalin tree using the clade containing the prokaryotic lipocalins. The topology of the rooted lipocalin tree is in general agreement with the currently accepted view of the organismal phylogeny of arthropods and chordates. The rooted tree allows us to assign polarity to character changes and suggests a plausible scenario for the evolution of important lipocalin properties. More recently evolved lipocalins tend to (1) show greater rates of amino acid substitutions, (2) have more flexible protein structures, (3) bind smaller hydrophobic ligands, and (4) increase the efficiency of their ligand-binding contacts. Finally, we found that the family of fatty-acid-binding proteins originated from the more derived lipocalins and therefore cannot be considered a sister group of the lipocalin family.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Carrier Proteins / genetics*
  • Evolution, Molecular*
  • Humans
  • Molecular Sequence Data
  • Phylogeny*
  • Sequence Alignment
  • Sequence Analysis

Substances

  • Carrier Proteins