SNP discovery in Litopenaeus vannamei with a new computational pipeline

Anim Genet. 2009 Feb;40(1):106-9. doi: 10.1111/j.1365-2052.2008.01792.x. Epub 2008 Sep 26.

Abstract

Litopenaeus vannamei (Pacific white shrimp) have been farmed in the Americas for many years and are growing in popularity in Asia with the development of specific pathogen-free stocks. The full genomic sequence of this species might not be available in the near future, so other tools are needed to discover the location of polymorphic sites for quantitative trait loci mapping, association studies and subsequent marker-assisted selection. Currently, 25 937 L. vannamei expressed sequence tags (ESTs) are publicly available. These sequences were manually screened, masked for tandem repeats and inputted into CAP3 for clustering. The resulting 3532 contigs were analysed for possible single nucleotide polymorphisms (SNPs) with SNPIDENTIFIER, a newly developed computer program for predicting SNPs. SNPIDENTIFIER is designed for ESTs without accompanying chromatogram sequence quality information, and therefore it performs quality control checks on all data. SNPIDENTIFIER sets a threshold such that the sequences used have a poor quality nucleotide (N) frequency <0.1, and it trims off the first 10 bases of every sequence to ensure higher sequence quality. For a base to be predicted as an SNP, the minor nucleotide (allele) frequency must be >0.1, it must be observed at least four times and the 15 bases on either side must exactly match the consensus sequence. Using these conservative parameters, 504 SNPs were predicted from 141 contigs for L. vannamei. A small sample of 18 individuals from three lines have been sequenced to verify prediction results and 17 of 39 (44%) of the tested SNPs have been confirmed.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Expressed Sequence Tags
  • Penaeidae / genetics*
  • Polymorphism, Single Nucleotide*
  • Software*