2019 IEEE International Symposium on Information Theory (ISIT) 2019
DOI: 10.1109/isit.2019.8849789
|View full text |Cite
|
Sign up to set email alerts
|

Capacity Results for the Noisy Shuffling Channel

Abstract: Motivated by DNA-based storage, we study the noisy shuffling channel, which can be seen as the concatenation of a standard noisy channel (such as the BSC) and a shuffling channel, which breaks the data block into small pieces and shuffles them. This channel models a DNA storage system, by capturing two of its key aspects: (1) the data is written onto many short DNA molecules that are stored in an unordered way and (2) the molecules are corrupted by noise at synthesis, sequencing, and during storage. For the BS… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 55 publications
(40 citation statements)
references
References 22 publications
0
37
0
Order By: Relevance
“…. storage," the authors of [17] studied the storage capacity of a model where DNA codewords first experienced the deleterious effects of a DMC (e.g., a binary symmetric channel), and were then fragmented, and subsequently, the fragments were randomly permuted. (Unlike [15], the receiver had access to all the permuted fragments in this model for simplicity.)…”
Section: A Related Literature and Motivationmentioning
confidence: 99%
“…. storage," the authors of [17] studied the storage capacity of a model where DNA codewords first experienced the deleterious effects of a DMC (e.g., a binary symmetric channel), and were then fragmented, and subsequently, the fragments were randomly permuted. (Unlike [15], the receiver had access to all the permuted fragments in this model for simplicity.)…”
Section: A Related Literature and Motivationmentioning
confidence: 99%
“…While previous works have proposed several coding schemes, there has been little understanding of the optimal tradeoff between writing cost (bases synthesized/information bit) and reading cost (bases sequenced/information bit). Recent work in [15] and [16] studied the information-theoretic capacity of a DNAbased storage channel, however the work has limited practical applicability due to various unrealistic assumptions and the asymptotic nature of their results.…”
Section: Previous Workmentioning
confidence: 99%
“…In this section, we consider a simplified model for DNAbased storage to develop a better understanding of the coding theoretic tradeoffs. While several previous works such as [15], [16], [18] theoretically analyze various aspects of the DNA-based storage problem (such as the information-theoretic capacity in the asymptotic setting and the optimality of various techniques to recover the order of the oligonucleotides), our main focus is to understand the tradeoff between the writing and reading cost associated with DNA-based storage and to motivate the scheme described in Section 3.…”
Section: Theoretical Analysismentioning
confidence: 99%
“…However, codewords there consist of multisets of symbols in some alphabet, and errors are comprised of deletions, insertions, substitutions, and erasures of symbols in the multiset. An information-theoretic treatment of related but abstracted models of DNA-based data storage may be found in [32,33]. Very recently, a model for clustering sequencing outputs according to the relevant DNA strand and codes that allow for correct clustering have been studied in [34].…”
Section: Introductionmentioning
confidence: 99%