2017
DOI: 10.1101/114553
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Scaling up DNA data storage and random access retrieval

Abstract: Current storage technologies can no longer keep pace with exponentially growing amounts of data.1 Synthetic DNA offers an attractive alternative due to its potential information density of ~ 10 18 B/mm 3 , 10 7 times denser than magnetic tape, and potential durability of thousands of years.2 Recent advances in DNA data storage have highlighted technical challenges, in particular, coding and random access, but have stored only modest amounts of data in synthetic DNA. 3,4,5 This paper demonstrates an end-to-end … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
47
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(48 citation statements)
references
References 12 publications
0
47
0
1
Order By: Relevance
“…We denote by s the maximum number of sequences that are never drawn (or their clusters are not identified), by t the maximum number of sequences, which have been reconstructed with errors with a maximum of ǫ errors of type E each. Typical error types E after the reconstruction step are insertions, deletions and substitutions, where the latter two are the most prominent ones in DNA storage systems [5]. To be more precise, we define the error balls associated with the channel model.…”
Section: B Dna Channel Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…We denote by s the maximum number of sequences that are never drawn (or their clusters are not identified), by t the maximum number of sequences, which have been reconstructed with errors with a maximum of ǫ errors of type E each. Typical error types E after the reconstruction step are insertions, deletions and substitutions, where the latter two are the most prominent ones in DNA storage systems [5]. To be more precise, we define the error balls associated with the channel model.…”
Section: B Dna Channel Modelmentioning
confidence: 99%
“…One way to address this problem is using block addresses, also called indices, that are stored as part of the strand. Errors in DNA are typically substitutions, insertions, and deletions, where most published studies report that either substitutions or deletions are the most prominent ones, depending upon the specific technology for synthesis and sequencing [2], [3], [4], [5], [6], [7]. For example, in column-based DNA oligo synthesis the dominant errors are deletions that result from either failure to remove the dimethoxytrityl (DMT) or combined inefficiencies in the coupling and capping steps [4].…”
Section: Introductionmentioning
confidence: 99%
“…However, as mentioned, such a problem can be easily resolved using data storage on DNA-carriers. Using synthetic DNA for the storage of numerical information makes it possible to achieve potential information density of 10 9 Gb/mm 3 at the potential durability of thousands of years (6). Thus, the researchers of Microsoft Corporation (Microsoft Research) in co-operation with scientists from University of Washington have stored more than 200 Mb numerical data in the form of DNA (4,6,7,8,9).…”
Section: Introductionmentioning
confidence: 99%
“…Using synthetic DNA for the storage of numerical information makes it possible to achieve potential information density of 10 9 Gb/mm 3 at the potential durability of thousands of years (6). Thus, the researchers of Microsoft Corporation (Microsoft Research) in co-operation with scientists from University of Washington have stored more than 200 Mb numerical data in the form of DNA (4,6,7,8,9). In particular, these data contain the encoding of high definition video (10), copies of the Universal Declaration of Human Rights in different languages, the top 100 books from Project Gutenberg, and the Crop Trust seed database (6).…”
Section: Introductionmentioning
confidence: 99%
“…Since then, several more groups have demonstrated the ability to successfully store data of large scale using DNA molecules; see e.g. [1], [2], [5], [13], [18]. Other works developed coding solutions which are specifically targeted to correct the special types of errors inside DNA-based storage systems [10]- [12], [14], [16]- [18].…”
Section: Introductionmentioning
confidence: 99%