2021
DOI: 10.48550/arxiv.2109.12549
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The DNA Storage Channel: Capacity and Error Probability

Abstract: The DNA storage channel is considered, in which M Deoxyribonucleic acid (DNA) molecules comprising each codeword, are stored without order, then sampled N times with replacement, and the sequenced over a discrete memoryless channel. For a constant coverage depth M/N and molecule length scaling Θ(log M ), lower (achievability) and upper (converse) bounds on the capacity of the channel, as well as a lower (achievability) bound on the reliability function of the channel are provided. Both the lower and upper boun… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 34 publications
(156 reference statements)
0
6
0
Order By: Relevance
“…The capacity of multi-draw DNA storage channels where the noisy channel p(y|x) is an arbitrary discrete memoryless channel recently received a careful treatment by Weinberger and Merhav [WM21]. The paper provides a new lower bound for general multi-draw DNA storage channels that is based on a random coding argument and holds for all values of β > 1, unlike the achievability arguments for Theorems 6 and 7, which only cover specific discrete memoryless channels.…”
Section: General Discrete Memoryless Channelsmentioning
confidence: 99%
See 1 more Smart Citation
“…The capacity of multi-draw DNA storage channels where the noisy channel p(y|x) is an arbitrary discrete memoryless channel recently received a careful treatment by Weinberger and Merhav [WM21]. The paper provides a new lower bound for general multi-draw DNA storage channels that is based on a random coding argument and holds for all values of β > 1, unlike the achievability arguments for Theorems 6 and 7, which only cover specific discrete memoryless channels.…”
Section: General Discrete Memoryless Channelsmentioning
confidence: 99%
“…as long as β > β cr , where β cr can be explicitly computed [WM21], and C n is the capacity of the modulo-additive channel with n draws. For the special case X = {0, 1}, the modulo-additive channel reduces to the BSC, and the result in Theorem 7 is recovered, but with a weaker requirement on p and β.…”
Section: General Discrete Memoryless Channelsmentioning
confidence: 99%
“…Multiple channel models have recently been suggested and studied based on this property. An assumption of overlap in read substrings and (near) uniform coverage leads to the problem of string reconstruction from substring composition [3], [4], [7], [13], [21], [22], [27], [29]; on the contrary, assuming no overlap in read substrings leads to the torn-paper problem [23], [25], [30], a problem closely related to the shuffling channel [16], [17], [28], [32]. This problem is motivated by DNA-based storage systems, where the information is stored in synthesized strands of DNA molecules.…”
Section: Introductionmentioning
confidence: 99%
“…Notably, when observations are composed of consecutive substrings, the reconstruction from substring-compositions problem [1], [1], [4], [10], [12], [17], [19], [20], [25], [27], [31], [32] and the torn-paper problem [2], [21], [22], [28] (a problem closely related to the shuffling channel [11], [13], [26], [30]) have received significant interest in the past decade due to applications in DNA-or polymer-based storage systems, resulting from contemporary sequencing technologies [4], [9], [20]. The former arises from an idealized assumption of full overlap (and uniform coverage) in read substrings, while the latter results from an assumption of no overlap; in applications, this models the question of whether the complete information string may be replicated and uniformly segmented for sequencing, or if segmentation occurs adversarially in the medium prior to sequencing.…”
Section: Introductionmentioning
confidence: 99%