Random access DNA memory using Boolean search in an archival file storage system

Nat Mater. 2021 Sep;20(9):1272-1280. doi: 10.1038/s41563-021-01021-3. Epub 2021 Jun 10.

Abstract

DNA is an ultrahigh-density storage medium that could meet exponentially growing worldwide demand for archival data storage if DNA synthesis costs declined sufficiently and if random access of files within exabyte-to-yottabyte-scale DNA data pools were feasible. Here, we demonstrate a path to overcome the second barrier by encapsulating data-encoding DNA file sequences within impervious silica capsules that are surface labelled with single-stranded DNA barcodes. Barcodes are chosen to represent file metadata, enabling selection of sets of files with Boolean logic directly, without use of amplification. We demonstrate random access of image files from a prototypical 2-kilobyte image database using fluorescence sorting with selection sensitivity of one in 106 files, which thereby enables one in 106N selection capability using N optical channels. Our strategy thereby offers a scalable concept for random access of archival files in large-scale molecular datasets.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Archives
  • DNA / chemistry*
  • Fluorescence
  • Information Storage and Retrieval*
  • Plasmids
  • Polymerase Chain Reaction
  • Silicon Dioxide / chemistry
  • Synthetic Biology

Substances

  • Silicon Dioxide
  • DNA