Automated system for gene annotation and metabolic pathway reconstruction using general sequence databases

Chem Biodivers. 2007 Nov;4(11):2593-602. doi: 10.1002/cbdv.200790212.

Abstract

Despite the growing number of genomes published or currently being sequenced, there is a relative paucity of software for functional classification of newly discovered genes and their assignment to metabolic pathways. Available software for such analyses has a very steep learning curve and requires the installation, configuration, and maintenance of large amounts of complex infrastructure, including complementary software and databases. Many such tools are restricted to one or a few data sources and classification schemes. In this work, we report an automated system for gene annotation and metabolic pathway reconstruction (ASGARD), which was designed to be powerful and generalizable, yet simple for the biologist to install and run on centralized, commonly available computers. It avoids the requirement for complex resources such as relational databases and web servers, as well as the need for administrator access to the operating system. Our methodology contributes to a more rapid investigation of the potential biochemical capabilities of genes and genomes by the biological researcher, and is useful in biochemical as well as comparative and evolutionary studies of pathways and networks.

Publication types

  • Comparative Study

MeSH terms

  • Base Sequence / genetics
  • Databases, Genetic / classification*
  • Genome / genetics
  • Metabolic Networks and Pathways / genetics*
  • Pattern Recognition, Automated / methods*
  • Sequence Analysis, Protein / methods
  • Software