nf-core/isoseq: simple gene and isoform annotation with PacBio Iso-Seq long-read sequencing

Bioinformatics. 2023 May 4;39(5):btad150. doi: 10.1093/bioinformatics/btad150.

Abstract

Motivation: Iso-Seq RNA long-read sequencing enables the identification of full-length transcripts and isoforms, removing the need for complex analysis such as transcriptome assembly. However, the raw sequencing data need to be processed in a series of steps before annotation is complete. Here, we present nf-core/isoseq, a pipeline for automatic read processing and genome annotation. Following nf-core guidelines, the pipeline has few dependencies and can be run on any of platforms.

Availability and implementation: The pipeline is freely available online on the nf-core website (https://nf-co.re/isoseq) and on GitHub (https://github.com/nf-core/isoseq) under MIT License (DOI: 10.5281/zenodo.7116979).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Genome*
  • Molecular Sequence Annotation
  • Protein Isoforms / genetics
  • Sequence Analysis, RNA
  • Transcriptome

Substances

  • Protein Isoforms