2014
DOI: 10.1007/978-3-319-11918-2_18
|View full text |Cite
|
Sign up to set email alerts
|

Indexed Matching Statistics and Shortest Unique Substrings

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
3
1

Relationship

4
4

Authors

Journals

citations
Cited by 13 publications
(20 citation statements)
references
References 22 publications
0
17
0
Order By: Relevance
“…. This property allows all three of these vectors to be encoded in 2x bits, where x is the length of the corresponding input string [62,8].…”
Section: Matching Statisticsmentioning
confidence: 99%
“…. This property allows all three of these vectors to be encoded in 2x bits, where x is the length of the corresponding input string [62,8].…”
Section: Matching Statisticsmentioning
confidence: 99%
“…This computation can be done in O(|y|) time [2,24]. By applying Fact 1, we can answer any query y in O(|y|) time for…”
Section: Square-free-preserved Matching Statisticsmentioning
confidence: 99%
“…Computing MS S,T is a classical problem in string processing, and in practice it involves building an index on a fixed T to answer a large number of queries S. Thus, solutions typically differ on the index they use, which can be the textbook suffix tree, the compressed suffix tree [29] or compressed suffix array, the colored longest common prefix array [17], a Burrows-Wheeler index combined with the suffix tree topology [3,4], or the r-index combined with balanced grammars [6]. In the frequent case where T consists of one genome (or proteome), or of the concatenation of few similar genomes or of many dissimilar genomes, the Burrows-Wheeler transform of T does not compress well, and the best space-time tradeoffs are achieved by the implementation in [4] (see [6] for a runtime comparison, and see Figure 2 in the supplement for a memory comparison).…”
Section: Introductionmentioning
confidence: 99%
“…Then, we implement fast range queries for computing the average and maximum matching statistic value inside a substring of S, taking advantage of the compact encoding of MS S,T introduced by [3]: this encoding takes just 2|S| bits, and allows one to retrieve MS[i] in constant time for any i using just o(|S|) more bits. In some cases this bitvector is compressible, so our code can operate both on the plain encoding and on its compressed versions.…”
Section: Introductionmentioning
confidence: 99%