Development of joint application strategies for two microbial gene finders

Bioinformatics. 2004 Jul 10;20(10):1622-31. doi: 10.1093/bioinformatics/bth137. Epub 2004 Feb 26.

Abstract

Motivation: As a starting point in annotation of bacterial genomes, gene finding programs are used for the prediction of functional elements in the DNA sequence. Due to the faster pace and increasing number of genome projects currently underway, it is becoming especially important to have performant methods for this task.

Results: This study describes the development of joint application strategies that combine the strengths of two microbial gene finders to improve the overall gene finding performance. Critica is very specific in the detection of similarity-supported genes as it uses a comparative sequence analysis-based approach. Glimmer employs a very sophisticated model of genomic sequence properties and is sensitive also in the detection of organism-specific genes. Based on a data set of 113 microbial genome sequences, we optimized a combined application approach using different parameters with relevance to the gene finding problem. This results in a significant improvement in specificity while there is similarity in sensitivity to Glimmer. The improvement is especially pronounced for GC rich genomes. The method is currently being applied for the annotation of several microbial genomes.

Availability: The methods described have been implemented within the gene prediction component of the GenDB genome annotation system.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • DNA, Bacterial / analysis
  • DNA, Bacterial / chemistry*
  • DNA, Bacterial / genetics*
  • Gene Expression Profiling / methods*
  • Genome, Bacterial*
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Bacterial