Geno2proteo, a Tool for Batch Retrieval of DNA and Protein Sequences from Any Genomic or Protein Regions

J Integr Bioinform. 2019 Jul 13;16(3):20180090. doi: 10.1515/jib-2018-0090.


The interconversion of sequences that constitute the genome and the proteome is becoming increasingly important due to the generation of large amounts of DNA sequence data. Following mapping of DNA segments to the genome, one fundamentally important task is to find the amino acid sequences which are coded within a list of genomic sections. Conversely, given a series of protein segments, an important task is to find the genomic loci which code for a list of protein regions. To perform these tasks on a region by region basis is extremely laborious when a large number of regions are being studied. We have therefore implemented an R package geno2proteo which performs the two mapping tasks and subsequent sequence retrieval in a batch fashion. In order to make the tool more accessible to users, we have created a web interface of the R package which allows the users to perform the mapping tasks by going to the web page and using the web service.

Keywords: DNA sequences; Genomics; Mapping between genome and proteome; Protein sequences; Proteomics.

MeSH terms

  • Databases, Nucleic Acid*
  • Genome*
  • Proteins / genetics*
  • Sequence Analysis, DNA*
  • Sequence Analysis, Protein*
  • Software*


  • Proteins