Molecular and bioinformatical characterization of a novel superfamily of cysteine-rich peptides from arthropods

Peptides. 2013 Mar:41:45-58. doi: 10.1016/j.peptides.2012.10.004. Epub 2012 Oct 22.

Abstract

The full-length cDNA sequences of two novel cysteine-rich peptides (referred to as HsVx1 and MmKTx1) were obtained from scorpions. The two peptides represent a novel class of cysteine-rich peptides with a unique cysteine pattern. The genomic sequence of HsVx1 is composed of three exons interrupted by two introns that are localized in the mature peptide encoding region and inserted in phase 1 and phase 2, respectively. Such a genomic organization markedly differs from those of other peptides from scorpions described previously. Genome-wide search for the orthologs of HsVx1 identified 59 novel cysteine-rich peptides from arthropods. These peptides share a consistent cysteine pattern with HsVx1. Genomic comparison revealed extensive intron length differences and intronic number and position polymorphisms among the genes of these peptides. Further analysis identified 30 cases of intron sliding, 1 case of intron gain and 22 cases of intron loss occurred with the genes of the HsVx1 and HsVx1-like peptides. It is interesting to see that three HsVx1-like peptides XP_001658928, XP_001658929 and XP_001658930 were derived from a single gene (XP gene): the former two were generated from alternative splicing; the third one was encoded by a DNA region in the reverse complementary strand of the third intron of the XP gene. These findings strongly suggest that the genes of these cysteine-rich peptides were evolved by intron sliding, intron gain/loss, gene recombination and alternative splicing events in response to selective forces without changing their cysteine pattern. The evolution of these genes is dominated by intron sliding and intron loss.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions
  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Computational Biology
  • Conserved Sequence
  • Cysteine / chemistry*
  • Evolution, Molecular
  • Insect Proteins / chemistry*
  • Insect Proteins / genetics
  • Introns
  • Molecular Sequence Data
  • Peptides / chemistry
  • Peptides / genetics
  • Phylogeny
  • Polymorphism, Genetic
  • Scorpions / genetics*
  • Sequence Homology, Amino Acid

Substances

  • 5' Untranslated Regions
  • Insect Proteins
  • Peptides
  • Cysteine

Associated data

  • GENBANK/JQ821370
  • GENBANK/JQ821371
  • GENBANK/KC140563