MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level

BMC Bioinformatics. 2008 Nov 27;9:498. doi: 10.1186/1471-2105-9-498.

Abstract

Background: The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons.

Description: Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons.

Conclusion: The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic.

MeSH terms

  • Algorithms
  • Bacteria / genetics*
  • Computational Biology / methods*
  • Conserved Sequence / genetics
  • Database Management Systems*
  • Databases, Genetic*
  • Genome, Bacterial*
  • Genomics / methods*
  • Internet
  • User-Computer Interface