2020
DOI: 10.1016/j.ic.2019.104513
|View full text |Cite
|
Sign up to set email alerts
|

Streaming k-mismatch with error correcting and applications

Abstract: We present a new streaming algorithm for the k-Mismatch problem, one of the most basic problems in pattern matching. Given a pattern and a text, the task is to find all substrings of the text that are at the Hamming distance at most k from the pattern. Our algorithm is enhanced with an important new feature called Error Correcting, and its complexities for k = 1 and for a general k are comparable to those of the solutions for the k-Mismatch problem by Porat and Porat (FOCS 2009) and Clifford et al. (SODA 2016… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…We pay our attention to communication complexity of string problems where the inputs A and B are strings over an alphabet Σ. Communication complexity of string problems has played a critical role in the space lower bound analysis of several streaming processing problems including Hamming/edit/swap distances [2], pattern matching with k-mismatches [3], parameterized pattern matching [4], dictionary matching [5], and quasiperiodicity [6].…”
Section: Introductionmentioning
confidence: 99%
“…We pay our attention to communication complexity of string problems where the inputs A and B are strings over an alphabet Σ. Communication complexity of string problems has played a critical role in the space lower bound analysis of several streaming processing problems including Hamming/edit/swap distances [2], pattern matching with k-mismatches [3], parameterized pattern matching [4], dictionary matching [5], and quasiperiodicity [6].…”
Section: Introductionmentioning
confidence: 99%
“…In that model, the text arrives in a stream, one character at a time, and the goal is to compute, or estimate, after the arrival of each text character, the Hamming distance between P and the current suffix of T . The state-of-the-art exact algorithm [15] uses Õ(k) space and costs Õ( √ k) time per character, which improves upon [14,23,36,38]. A recent approximate streaming algorithm [12] uses Õ(min(ε −2 √ k, ε −1.5 √ n)) space and costs Õ(ε −3 ) time per character, which improves upon [16,39].…”
Section: Related Workmentioning
confidence: 99%
“…Related to our work is the problem of streaming pattern matching, where the goal is to find all occurrences of a pattern (possibly with some bounded number mismatches) in a data stream, see e.g. [65,80,56,53,19,20,21,22,23,57,59,58,78,84] and search of repetitions in streams [33,31,32,52,51,72,73].…”
Section: Regmentioning
confidence: 99%