orfipy: a fast and flexible tool for extracting ORFs
- PMID: 33576786
- PMCID: PMC8479652
- DOI: 10.1093/bioinformatics/btab090
orfipy: a fast and flexible tool for extracting ORFs
Abstract
Summary: Searching for open reading frames is a routine task and a critical step prior to annotating protein coding regions in newly sequenced genomes or de novo transcriptome assemblies. With the tremendous increase in genomic and transcriptomic data, faster tools are needed to handle large input datasets. These tools should be versatile enough to fine-tune search criteria and allow efficient downstream analysis. Here we present a new python based tool, orfipy, which allows the user to flexibly search for open reading frames in genomic and transcriptomic sequences. The search is rapid and is fully customizable, with a choice of FASTA and BED output formats.
Availability and implementation: orfipy is implemented in python and is compatible with python v3.6 and higher. Source code: https://github.com/urmi-21/orfipy. Installation: from the source, or via PyPi (https://pypi.org/project/orfipy) or bioconda (https://anaconda.org/bioconda/orfipy).
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press.
Figures
Similar articles
-
Pygenprop: a Python library for programmatic exploration and comparison of organism genome properties.Bioinformatics. 2019 Dec 1;35(23):5063-5065. doi: 10.1093/bioinformatics/btz522. Bioinformatics. 2019. PMID: 31240307
-
pyrpipe: a Python package for RNA-Seq workflows.NAR Genom Bioinform. 2021 Jun 1;3(2):lqab049. doi: 10.1093/nargab/lqab049. eCollection 2021 Jun. NAR Genom Bioinform. 2021. PMID: 34085037 Free PMC article.
-
GeneNoteBook, a collaborative notebook for comparative genomics.Bioinformatics. 2019 Nov 1;35(22):4779-4781. doi: 10.1093/bioinformatics/btz491. Bioinformatics. 2019. PMID: 31199463 Free PMC article.
-
Trackplot: A flexible toolkit for combinatorial analysis of genomic data.PLoS Comput Biol. 2023 Sep 5;19(9):e1011477. doi: 10.1371/journal.pcbi.1011477. eCollection 2023 Sep. PLoS Comput Biol. 2023. PMID: 37669275 Free PMC article.
-
plotsr: visualizing structural similarities and rearrangements between multiple genomes.Bioinformatics. 2022 May 13;38(10):2922-2926. doi: 10.1093/bioinformatics/btac196. Bioinformatics. 2022. PMID: 35561173 Free PMC article.
Cited by
-
Potential Ancestral Conoidean Toxins in the Venom Cocktail of the Carnivorous Snail Raphitoma purpurea (Montagu, 1803) (Neogastropoda: Raphitomidae).Toxins (Basel). 2024 Aug 9;16(8):348. doi: 10.3390/toxins16080348. Toxins (Basel). 2024. PMID: 39195758 Free PMC article.
-
Pro-SMP finder-A systematic approach for discovering small membrane proteins in prokaryotes.PLoS One. 2024 Feb 29;19(2):e0299169. doi: 10.1371/journal.pone.0299169. eCollection 2024. PLoS One. 2024. PMID: 38422081 Free PMC article.
-
Mining Public Data to Investigate the Virome of Neglected Pollinators and Other Floral Visitors.Viruses. 2023 Aug 31;15(9):1850. doi: 10.3390/v15091850. Viruses. 2023. PMID: 37766257 Free PMC article.
-
Transfer of Erwinia toletana and Erwinia iniecta to a novel genus Winslowiella gen. nov. as Winslowiella toletana comb. nov. and Winslowiella iniecta comb. nov. and description of Winslowiella arboricola sp. nov., isolated from bleeding cankers on broadleaf hosts.Front Microbiol. 2022 Nov 17;13:1063107. doi: 10.3389/fmicb.2022.1063107. eCollection 2022. Front Microbiol. 2022. PMID: 36466697 Free PMC article.
-
Large-Scale Identification of Known and Novel RRNPP Quorum-Sensing Systems by RRNPP_Detector Captures Novel Features of Bacterial, Plasmidic, and Viral Coevolution.Mol Biol Evol. 2023 Apr 4;40(4):msad062. doi: 10.1093/molbev/msad062. Mol Biol Evol. 2023. PMID: 36929912 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
