Alignment-based Approach for Durable Data Storage Into Living Organisms

Biotechnol Prog. Mar-Apr 2007;23(2):501-5. doi: 10.1021/bp060261y. Epub 2007 Jan 25.


The practical realization of DNA data storage is a major scientific goal. Here we introduce a simple, flexible, and robust data storage and retrieval method based on sequence alignment of the genomic DNA of living organisms. Duplicated data encoded by different oligonucleotide sequences was inserted redundantly into multiple loci of the Bacillus subtilis genome. Multiple alignment of the bit data sequences decoded by B. subtilis genome sequences enabled the retrieval of stable and compact data without the need for template DNA, parity checks, or error-correcting algorithms. Combined with the computational simulation of data retrieval from mutated message DNA, a practical use of this alignment-based method is discussed.

MeSH terms

  • Base Sequence
  • Computer Simulation
  • Computers, Molecular*
  • DNA Mutational Analysis / methods
  • DNA, Bacterial / chemistry*
  • DNA, Bacterial / genetics*
  • Genetic Code*
  • Genome, Bacterial / genetics
  • Information Storage and Retrieval / methods*
  • Models, Chemical
  • Models, Genetic
  • Molecular Sequence Data
  • Sequence Alignment / methods*
  • Signal Processing, Computer-Assisted*


  • DNA, Bacterial