Extensive gene amplification and concerted evolution within the CPR family of cuticular proteins in mosquitoes

Insect Biochem Mol Biol. 2008 Jun;38(6):661-76. doi: 10.1016/j.ibmb.2008.04.001. Epub 2008 May 19.

Abstract

Annotation of the Anopheles gambiae genome has revealed a large increase in the number of genes encoding cuticular proteins with the Rebers and Riddiford Consensus (the CPR gene family) relative to Drosophila melanogaster. This increase reflects an expansion of the RR-2 group of CPR genes, particularly the amplification of sets of highly similar paralogs. Patterns of nucleotide variation indicate that extensive concerted evolution is occurring within these clusters. The pattern of concerted evolution is complex, however, as sequence similarity within clusters is uncorrelated with gene order and orientation, and no comparable clusters occur within similarly compact arrays of the RR-1 group in mosquitoes or in either group in D. melanogaster. The dearth of pseudogenes suggests that sequence clusters are maintained by selection for high gene-copy number, perhaps due to selection for high expression rates. This hypothesis is consistent with the apparently parallel evolution of compact gene architectures within sequence clusters relative to single-copy genes. We show that RR-2 proteins from sequence-cluster genes have complex repeats and extreme amino-acid compositions relative to single-copy CPR proteins in An. gambiae, and that the amino-acid composition of the N-terminal and C-terminal sequence flanking the chitin-binding consensus region evolves in a correlated fashion.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Anopheles / genetics*
  • Culex / genetics
  • Evolution, Molecular*
  • Gene Amplification*
  • Genome, Insect
  • Insect Proteins / chemistry
  • Insect Proteins / genetics*
  • Molecular Sequence Data
  • Multigene Family
  • Phylogeny

Substances

  • Insect Proteins