Database and Informatic Challenges in Representing Both Diploid and Tetraploid Xenopus Species in Xenbase

Cytogenet Genome Res. 2015;145(3-4):278-82. doi: 10.1159/000430427. Epub 2015 May 27.

Abstract

At the heart of databases is a data model referred to as a schema. Relational databases store information in tables, and the schema defines the tables and provides a map of relationships that show how the different table/data types relate to one another. In Xenbase, we were tasked to represent genomic, molecular, and biological data of both a diploid and tetraploid Xenopus species. When the database model was built over a decade ago, we had very little information on the nature of the X.laevis tetraploidization, but a Chado-based data model was proposed that could deal with the various forms of data in both species. Once the X.laevis genome was sequenced and annotated, it became clear that the data schema is very like the evolutionary schema that resulted in the X.laevis genome.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • DNA, Complementary / genetics
  • Database Management Systems*
  • Databases, Genetic*
  • Diploidy*
  • Gene Expression
  • Genome
  • Models, Theoretical
  • Molecular Sequence Annotation
  • Species Specificity
  • Tetraploidy*
  • Vocabulary, Controlled
  • Xenopus / genetics*
  • Xenopus laevis / genetics

Substances

  • DNA, Complementary