Optimizing de novo transcriptome assembly and extending genomic resources for striped catfish (Pangasianodon hypophthalmus)

Mar Genomics. 2015 Oct:23:87-97. doi: 10.1016/j.margen.2015.05.001. Epub 2015 May 12.

Abstract

Striped catfish (Pangasianodon hypophthalmus) is a commercially important freshwater fish used in inland aquaculture in the Mekong Delta, Vietnam. The culture industry is facing a significant challenge however from saltwater intrusion into many low topographical coastal provinces across the Mekong Delta as a result of predicted climate change impacts. Developing genomic resources for this species can facilitate the production of improved culture lines that can withstand raised salinity conditions, and so we have applied high-throughput Ion Torrent sequencing of transcriptome libraries from six target osmoregulatory organs from striped catfish as a genomic resource for use in future selection strategies. We obtained 12,177,770 reads after trimming and processing with an average length of 97bp. De novo assemblies were generated using CLC Genomic Workbench, Trinity and Velvet/Oases with the best overall contig performance resulting from the CLC assembly. De novo assembly using CLC yielded 66,451 contigs with an average length of 478bp and N50 length of 506bp. A total of 37,969 contigs (57%) possessed significant similarity with proteins in the non-redundant database. Comparative analyses revealed that a significant number of contigs matched sequences reported in other teleost fishes, ranging in similarity from 45.2% with Atlantic cod to 52% with zebrafish. In addition, 28,879 simple sequence repeats (SSRs) and 55,721 single nucleotide polymorphisms (SNPs) were detected in the striped catfish transcriptome. The sequence collection generated in the current study represents the most comprehensive genomic resource for P. hypophthalmus available to date. Our results illustrate the utility of next-generation sequencing as an efficient tool for constructing a large genomic database for marker development in non-model species.

Keywords: Ion Torrent; Pangasianodon hypophthalmus; Salinity tolerance; Simple sequence repeat; Single nucleotide polymorphism; Transcriptome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Catfishes / genetics*
  • Expressed Sequence Tags
  • Fish Proteins / genetics
  • Fish Proteins / metabolism
  • Gene Expression Regulation / physiology
  • Genomics*
  • Polymorphism, Single Nucleotide
  • Protein Structure, Tertiary
  • Salinity
  • Transcriptome*

Substances

  • Fish Proteins