Comparative Genome Analysis and Gene Finding in Candida Species Using CGOB

Mol Biol Evol. 2013 Jun;30(6):1281-91. doi: 10.1093/molbev/mst042. Epub 2013 Mar 13.

Abstract

The Candida Gene Order Browser (CGOB) was developed as a tool to visualize and analyze synteny relationships in multiple Candida species, and to provide an accurate, manually curated set of orthologous Candida genes for evolutionary analyses. Here, we describe major improvements to CGOB. The underlying structure of the database has been changed significantly. Genomic features are now based directly on genome annotations rather than on protein sequences, which allows non-protein features such as centromere locations in Candida albicans and tRNA genes in all species to be included. The data set has been expanded to 13 species, including genomes of pathogens (C. albicans, C. parapsilosis, C. tropicalis, and C. orthopsilosis), and those of xylose-degrading species with important biotechnological applications (C. tenuis, Scheffersomyces stipitis, and Spathaspora passalidarum). Updated annotations of C. parapsilosis, C. dubliniensis, and Debaryomyces hansenii have been incorporated. We discovered more than 1,500 previously unannotated genes among the 13 genomes, ranging in size from 29 to 3,850 amino acids. Poorly conserved and rapidly evolving genes were also identified. Re-analysis of the mating type loci of the xylose degraders suggests that C. tenuis is heterothallic, whereas both Spa. passalidarum and S. stipitis are homothallic. As well as hosting the browser, the CGOB website (http://cgob.ucd.ie) gives direct access to all the underlying genome annotations, sequences, and curated orthology data.

Keywords: CGOB; Candida; comparative genomics; gene order; pathogens; synteny; xylose; yeast.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Candida / classification
  • Candida / genetics*
  • Databases, Genetic*
  • Genes, Fungal*
  • Genome, Fungal*
  • Genomics / methods*
  • Models, Theoretical
  • Molecular Sequence Annotation
  • Molecular Sequence Data
  • Phylogeny
  • Sequence Alignment
  • Software*
  • User-Computer Interface