Simple re-instantiation of small databases using cloud computing

BMC Genomics. 2013;14 Suppl 5(Suppl 5):S13. doi: 10.1186/1471-2164-14-S5-S13. Epub 2013 Oct 16.

Abstract

Background: Small bioinformatics databases, unlike institutionally funded large databases, are vulnerable to discontinuation and many reported in publications are no longer accessible. This leads to irreproducible scientific work and redundant effort, impeding the pace of scientific progress.

Results: We describe a Web-accessible system, available online at http://biodb100.apbionet.org, for archival and future on demand re-instantiation of small databases within minutes. Depositors can rebuild their databases by downloading a Linux live operating system (http://www.bioslax.com), preinstalled with bioinformatics and UNIX tools. The database and its dependencies can be compressed into an ".lzm" file for deposition. End-users can search for archived databases and activate them on dynamically re-instantiated BioSlax instances, run as virtual machines over the two popular full virtualization standard cloud-computing platforms, Xen Hypervisor or vSphere. The system is adaptable to increasing demand for disk storage or computational load and allows database developers to use the re-instantiated databases for integration and development of new databases.

Conclusions: Herein, we demonstrate that a relatively inexpensive solution can be implemented for archival of bioinformatics databases and their rapid re-instantiation should the live databases disappear.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archives
  • Computational Biology / methods*
  • Databases, Factual*
  • Internet*
  • Software
  • User-Computer Interface