2020
DOI: 10.1016/j.tcs.2019.08.012
|View full text |Cite
|
Sign up to set email alerts
|

Approximate pattern matching on elastic-degenerate text

Abstract: An elastic-degenerate string is a sequence of n sets of strings of total length N. It has been introduced to represent a multiple alignment of several closely-related sequences (e.g., pan-genome) compactly. In this representation, substrings of these sequences that match exactly are collapsed, while in positions where the sequences differ, all possible variants observed at that location are listed. The natural problem that arises is finding all matches of a deterministic pattern of length m in an elastic-degen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
3

Relationship

3
7

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…We leave it as open question if similar solutions can be derived for arbitrary DAGs, or e.g. for elastic degenerate strings [5]. Or more generally, can bit-parallel algorithms with more complex dependencies, like in the case of approximate pattern matching, be turned into quantum algorithms?…”
Section: Discussionmentioning
confidence: 99%
“…We leave it as open question if similar solutions can be derived for arbitrary DAGs, or e.g. for elastic degenerate strings [5]. Or more generally, can bit-parallel algorithms with more complex dependencies, like in the case of approximate pattern matching, be turned into quantum algorithms?…”
Section: Discussionmentioning
confidence: 99%
“…An elastic-degenerate string is a sequence of sets of strings used for succinctly representing a multiple alignment of a collection of closely related sequences (e.g. a pan-genome, that is all genes and genetic variation within a species), and also supports approximate pattern matching [4]. Sequence alignment is useful for inferring evolutionary relationships between biological sequences.…”
Section: Applications On Meta Stringsmentioning
confidence: 99%
“…That is, any set ofS does not contain, in general, only single letters, nor substrings of the same length, as it may contain strings of different lengths, including the empty string (see Figure 1). In a few recent papers, several algorithms for pattern matching on ED strings have been presented; specifically, for finding all exact [8,10,11,12,13] or approximate [14,15] occurrences of a standard string pattern in an ED text.…”
Section: And References Therein)mentioning
confidence: 99%