2020
DOI: 10.48550/arxiv.2002.11342
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Asymmetric Streaming Algorithms for Edit Distance and LCS

Abstract: The edit distance (ED) and longest common subsequence (LCS) are two fundamental problems which quantify how similar two strings are to one another. In this paper, we consider these problems in the streaming model where one string is available via oracle queries and the other string comes as a stream of characters. Our main contribution is a constant factor approximation algorithm in this setting for ED with memory O(n δ ) for any δ > 0. In addition to this, we present an upper bound of Õ( √ n) on the memory ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
7
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(8 citation statements)
references
References 45 publications
1
7
0
Order By: Relevance
“…We then consider deterministic approximation of LCS. Here, the work of [GG10,EJ08] gives a lower bound of Ω 1 R n ε log |Σ| εn for any R pass streaming algorithm achieving a 1 + ε approximation of LIS, which also implies a lower bound of Ω 1 R n ε log 1 ε for asymmetric streaming LCS when |Σ| ≥ n. These bounds match the upper bound in [GJKK07] for LIS and LNS, and in [FHRS20,CJLZ20] for LCS. However, a major drawback of this bound is that it gives nothing when |Σ| is small (e.g., |Σ| ≤ εn).…”
Section: ω(supporting
confidence: 71%
See 3 more Smart Citations
“…We then consider deterministic approximation of LCS. Here, the work of [GG10,EJ08] gives a lower bound of Ω 1 R n ε log |Σ| εn for any R pass streaming algorithm achieving a 1 + ε approximation of LIS, which also implies a lower bound of Ω 1 R n ε log 1 ε for asymmetric streaming LCS when |Σ| ≥ n. These bounds match the upper bound in [GJKK07] for LIS and LNS, and in [FHRS20,CJLZ20] for LCS. However, a major drawback of this bound is that it gives nothing when |Σ| is small (e.g., |Σ| ≤ εn).…”
Section: ω(supporting
confidence: 71%
“…Second, because it is a relaxation of the standard streaming model, one can hope to design better algorithms for ED or to beat the strong lower bounds for LCS in this model. The latter point is indeed verified by two recent works [FHRS20,CJLZ20], which give a deterministic one pass algorithm achieving a O(2 1/δ ) approximation of ED, using space Õ(n δ /δ) and time Õδ (n 4 ) for any constant δ > 0, as well as deterministic one pass algorithms achieving 1 ± ε approximation of ED and LCS, using space Õ( √ n ε ) and time Õε (n 2 ). A natural question is how much we can improve these results.…”
Section: Introductionmentioning
confidence: 52%
See 2 more Smart Citations
“…In this streaming model, we have random access to one string and streaming access to the other. Recently, Farhadi et al [10] presented asymmetric streaming algorithms for edit distance and longest common subsequence. Notably, that work presents an algorithm with a constant factor approximation for edit distance with memory Õ(n δ ) for any δ > 0.…”
Section: Related Workmentioning
confidence: 99%