2019 IEEE International Symposium on Information Theory (ISIT) 2019
DOI: 10.1109/isit.2019.8849737
|View full text |Cite
|
Sign up to set email alerts
|

Clustering-Correcting Codes

Abstract: A new family of codes, called clustering-correcting codes, is presented in this paper. This family of codes is motivated by the special structure of data that is stored in DNA-based storage systems. The data stored in these systems has the form of unordered sequences, also called strands, and every strand is synthesized thousands to millions of times, where some of these copies are read back during sequencing. Due to the unordered structure of the strands, an important task in the decoding process is to place … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 24 publications
0
10
0
Order By: Relevance
“…The problem of clustering output strings was studied from a coding-theoretic standpoint in [15]. It was shown that "codeaware" clustering, i.e., a clustering algorithm that exploits…”
Section: A Related Literaturementioning
confidence: 99%
“…The problem of clustering output strings was studied from a coding-theoretic standpoint in [15]. It was shown that "codeaware" clustering, i.e., a clustering algorithm that exploits…”
Section: A Related Literaturementioning
confidence: 99%
“…is w-subsequenceunique and simultaneously satisfies Properties 23 and 39 is found. Such a string z is guaranteed to exist because all such properties hold for x ′ + g(U t ) with probability 1 − o(1) (see Lemmas 22,34,and 38). Moreover, whether x ′ + g(z) satisfies all three properties can be checked in time poly(m);…”
Section: Using the Code Within A Marker-based Constructionmentioning
confidence: 99%
“…An information-theoretic treatment of related but abstracted models of DNA-based data storage may be found in [32,33]. Very recently, a model for clustering sequencing outputs according to the relevant DNA strand and codes that allow for correct clustering have been studied in [34].…”
Section: Introductionmentioning
confidence: 99%
“…The redundancy required to force such a constraint on a collection of vectors will be calculated later. For the case of t = 0, the set A(l, 0, ǫ 1 , ǫ 2 ) is called clustering-correcting code, and explicit constructions which require only one bit of redundancy and can be encoded and decoded efficiently can be found in [14]. The anchoring property will be used to reconstruct the ordering of the sequences.…”
Section: Constructionmentioning
confidence: 99%