Comprehensive analysis of full-length transcriptomes of Schizothorax prenanti by single-molecule long-read sequencing

Genomics. 2022 Jan;114(1):456-464. doi: 10.1016/j.ygeno.2021.01.009. Epub 2021 Jan 28.

Abstract

Schizothorax prenanti (S. prenanti) is one of the most important aquaculture species in the southwest of China. However, information of the full-length transcripts in S. prenanti remains unknown. In this study, single-molecule real-time (SMRT) sequencing was performed to generate full-length transcriptomes of S.prenanti. In total, 23.26 Gb of clean reads were generated. A total of 312,587 circular consensus sequences (CCS) were obtained with average lengths of 2634 bp and 84.16% (270,662) of CCS were full-length non-chimeric reads. After being corrected with Illumina library sequencing, 18,005 contigs were obtained, with 17,797 (98.81%) successfully annotated in eight public databases, including 15,839 complete open reading frames (ORFs) with an average length of 1330 bp. Furthermore, a total of 4152 alternative splicing (AS) events and 250 long non-coding RNA (lncRNA) transcripts were detected. Additionally, a total of 1129 putative transcription factors (TFs) members from 56 TF families and 11,660 simple sequence repeats (SSRs) were identified. This study provided a valuable resource of full-length transcripts for further research on S. prenanti.

Keywords: Cyprinidae; Gene ontology; Long non-coding RNA; Microsatellite; RNA-seq.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Animals
  • Cyprinidae* / genetics
  • High-Throughput Nucleotide Sequencing
  • RNA, Long Noncoding* / genetics
  • Transcriptome*

Substances

  • RNA, Long Noncoding