Diversity and evolution of conotoxins based on gene expression profiling of Conus litteratus

Genomics. 2006 Dec;88(6):809-819. doi: 10.1016/j.ygeno.2006.06.014. Epub 2006 Aug 14.

Abstract

Cone snails are attracting increasing scientific attention due to their unprecedented diversity of invaluable channel-targeted peptides. As arguably the largest and most successful evolutionary genus of invertebrates, Conus also may become the model system to study the evolution of multigene families and biodiversity. Here, a set of 897 expressed sequence tags (ESTs) derived from a Conus litteratus venom duct was analyzed to illuminate the diversity and evolution mechanism of conotoxins. Nearly half of these ESTs represent the coding sequences of conotoxins, which were grouped into 42 novel conotoxin cDNA sequences (seven superfamilies), with T-superfamily conotoxins being the dominant component. The gene expression profile of conotoxin revealed that transcripts are expressed with order-of-magnitude differences, sequence divergence within a superfamily increases from the N to the C terminus of the open reading frame, and even multiple scaffold-different mature peptides exist in a conotoxin gene superfamily. Most excitingly, we identified a novel conotoxin superfamily and three novel cysteine scaffolds. These results give an initial insight into the C. litteratus transcriptome that will contribute to a better understanding of conotoxin evolution and the study of the cone snail genome in the near future.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Conotoxins* / chemistry
  • Conotoxins* / genetics
  • Conotoxins* / metabolism
  • Conus Snail / genetics
  • Conus Snail / metabolism*
  • Evolution, Molecular*
  • Exons
  • Gene Expression Profiling*
  • Gene Library
  • Genetic Variation*
  • Molecular Sequence Data
  • Neurotoxins* / chemistry
  • Neurotoxins* / genetics
  • Neurotoxins* / metabolism
  • Proteins / genetics
  • Proteins / metabolism
  • Sequence Analysis, DNA

Substances

  • Conotoxins
  • Neurotoxins
  • Proteins

Associated data

  • GENBANK/ABA39796
  • GENBANK/DQ205654
  • GENBANK/DQ345351
  • GENBANK/DQ345352
  • GENBANK/DQ345353
  • GENBANK/DQ345354
  • GENBANK/DQ345355
  • GENBANK/DQ345356
  • GENBANK/DQ345357
  • GENBANK/DQ345358
  • GENBANK/DQ345359
  • GENBANK/DQ345360
  • GENBANK/DQ345361
  • GENBANK/DQ345362
  • GENBANK/DQ345363
  • GENBANK/DQ345364
  • GENBANK/DQ345365
  • GENBANK/DQ345366
  • GENBANK/DQ345367
  • GENBANK/DQ345368
  • GENBANK/DQ345369
  • GENBANK/DQ345370
  • GENBANK/DQ345371
  • GENBANK/DQ345372
  • GENBANK/DQ345373
  • GENBANK/DQ345374
  • GENBANK/DQ345375
  • GENBANK/DQ345376
  • GENBANK/DQ345377
  • GENBANK/DQ345378
  • GENBANK/DQ345379
  • GENBANK/DQ345380
  • GENBANK/DQ345381
  • GENBANK/DQ345382
  • GENBANK/DQ345383
  • GENBANK/DQ345384
  • GENBANK/DQ345385
  • GENBANK/DQ345386
  • GENBANK/DQ345387
  • GENBANK/DQ345388
  • GENBANK/DQ345389
  • GENBANK/DQ345390
  • GENBANK/DQ345391
  • GENBANK/DQ359921
  • GENBANK/DQ359922