2011
DOI: 10.1007/978-3-642-21458-5_27
|View full text |Cite
|
Sign up to set email alerts
|

Faster Subsequence and Don’t-Care Pattern Matching on Compressed Texts

Abstract: Subsequence pattern matching problems on compressed text were first considered by Cégielski et al. (Window Subsequence Problems for Compressed Texts, Proc. CSR 2006, LNCS 3967, pp. 127-136), where the principal problem is: given a string T represented as a straight line program (SLP) T of size n, a string P of size m, compute the number of minimal subsequence occurrences of P in T. We present an O(nm) time algorithm for solving all variations of the problem introduced by Cégielski et al.. This improves the pre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 21 publications
(11 citation statements)
references
References 16 publications
0
11
0
Order By: Relevance
“…Let us call this problem the semi-compressed subsequence matching problem. There is a straightforward algorithm for semi-compressed subsequence matching that works in time O.jpj jT j/, see [132,142]. Assume that the SLP T is in Chomsky normal form.…”
Section: Inputmentioning
confidence: 99%
See 1 more Smart Citation
“…Let us call this problem the semi-compressed subsequence matching problem. There is a straightforward algorithm for semi-compressed subsequence matching that works in time O.jpj jT j/, see [132,142]. Assume that the SLP T is in Chomsky normal form.…”
Section: Inputmentioning
confidence: 99%
“…The values`.i; X / satisfy a simple recursion, which allows to compute all these values in time O.jpj jT j/. More complex problems related to semi-compressed subsequence matching (e.g., counting the number of shortest factors of eval.T /, which contain p as a subsequence) are considered in [22,133,142].…”
Section: Inputmentioning
confidence: 99%
“…Compression Type # of parameters Interpretability Optimization PCA-SL [14,9] Orthogonal rotation Lossy 2 Limited Stable bMH-SL [21] Hashing Lossy 3 Unable Stable SGD [33,8] Sampling -1 Limited Unstable cPLS (this study) Grammar compression Lossless 1 High Stable covered from grammar-compressed data) that also has a wide variety of applications in string processing, such as pattern matching [37], edit-distance computation [13], and q-gram mining [1]. Grammar compression builds a small context-free grammar that generates only the input data and is very effective at compressing sequences that contain many repeats.…”
Section: Approachmentioning
confidence: 99%
“…Representative methods are RePair [12] and LCA [17]. Methods for type (ii) have also been presented for processing repetitive texts, e.g., pattern matching [18], pattern mining [9] and edit distance computation [10]. Table 1.…”
Section: Introductionmentioning
confidence: 99%