Pro-Frame: similarity-based gene recognition in eukaryotic DNA sequences with errors

Bioinformatics. 2001 Jan;17(1):13-5. doi: 10.1093/bioinformatics/17.1.13.

Abstract

Performance of existing algorithms for similarity-based gene recognition in eukaryotes drops when the genomic DNA has been sequenced with errors. A modification of the spliced alignment algorithm allows for gene recognition in sequences with errors, in particular frameshifts. It tolerates up to 5% of sequencing errors without considerable drop of prediction reliability when a sufficiently close homologous protein is available (normalized evolutionary distance similarity score 50% or higher).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computational Biology
  • Frameshifting, Ribosomal
  • Humans
  • Proteins / genetics
  • Sequence Alignment / methods*
  • Sequence Alignment / statistics & numerical data
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, DNA / statistics & numerical data
  • Software

Substances

  • Proteins