Phylogenetic analysis of DNA length mutations in a repetitive region of the Hawaiian Drosophila yolk protein gene Yp2

J Mol Evol. 1996 Aug;43(2):116-24. doi: 10.1007/BF02337356.


Nucleotide sequence analysis has demonstrated that interspecific size variation in the YP2 yolk protein among Hawaiian Drosophila is due to in-frame insertions and deletions in two repetitive segments of the coding region of the Yp2 gene. Sequence comparisons of the complex repetitive region close to the 5' end of this gene across 34 endemic Hawaiian taxa revealed five length morphs, spanning a length difference of 21 nucleotides (nt). A phylogenetic character reconstruction of the length mutations on an independently derived molecular phylogeny showed clade-specific length variants arising from six ancient events: two identical insertions of 6 nt, and four deletions, one of 6 nt, one of 12 nt, and two identical but independent deletions of 15 nt. These mutations can be attributed to replication slippage with nontandem trinucleotide repeats playing a major role in the slipped-strand mispairing. Geographic analysis suggests that the 15 nt deletion which distinguishes the planitibia subgroup from the cyrtoloma subgroup occurred on Oahu about 3 million years ago. The homoplasies observed caution against relying too heavily on nucleotide insertions/deletions for phylogenetic inference. In contrast to the extensive repeat polymorphisms within other Drosophila and the human species, the more complex 5' Yp2 repetitive region analyzed here appears to lack polymorphism among Hawaiian Drosophila, perhaps due to founder effects, low population sizes, and hitchhiking effects of selection on the immediately adjacent 5' region.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • DNA / genetics*
  • DNA Primers
  • DNA Transposable Elements
  • Drosophila / genetics*
  • Drosophila Proteins*
  • Egg Proteins / genetics*
  • Genetic Variation
  • Hawaii
  • Humans
  • Molecular Sequence Data
  • Mutation*
  • Phylogeny*
  • Polymerase Chain Reaction
  • Polymorphism, Genetic*
  • Repetitive Sequences, Nucleic Acid*
  • Sequence Deletion
  • Sequence Homology, Nucleic Acid
  • Vitellogenins*


  • DNA Primers
  • DNA Transposable Elements
  • Drosophila Proteins
  • Egg Proteins
  • Vitellogenins
  • yolk protein 2, Drosophila
  • DNA

Associated data

  • GENBANK/U61697
  • GENBANK/U61698
  • GENBANK/U61699
  • GENBANK/U61700
  • GENBANK/U61701
  • GENBANK/U61702
  • GENBANK/U61703
  • GENBANK/U61704
  • GENBANK/U61705
  • GENBANK/U61706
  • GENBANK/U61707
  • GENBANK/U61708
  • GENBANK/U61709
  • GENBANK/U61710
  • GENBANK/U61711
  • GENBANK/U61712
  • GENBANK/U61713
  • GENBANK/U61714
  • GENBANK/U61715
  • GENBANK/U61716
  • GENBANK/U61717
  • GENBANK/U61718
  • GENBANK/U61719
  • GENBANK/U61720
  • GENBANK/U61721
  • GENBANK/U61722