2011
DOI: 10.1016/j.jda.2010.12.001
|View full text |Cite
|
Sign up to set email alerts
|

Position heaps: A simple and dynamic text indexing data structure

Abstract: We address the problem of finding the locations of all instances of a string P in a text T , where preprocessing of T is allowed in order to facilitate the queries. Previous data structures for this problem include the suffix tree, the suffix array, and the compact DAWG. We modify a data structure called a sequence tree, which was proposed by Coffman and Eve (1970) [3] for hashing, and adapt it to the new problem. We can then produce a list of k occurrences of any string P in T in O ( P + k) time. Because of p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
63
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 39 publications
(65 citation statements)
references
References 9 publications
(15 reference statements)
1
63
0
Order By: Relevance
“…On the other side in LSTs the two pointers (length and position in the text) associated with any arc (where the total number of arcs is bounded by 2n − 2) are replaced with one symbol. Anyhow, the LSTs cannot compete for instance, in terms of space efficiency, with some relatively new data structures, such as compressed suffix trees [24] and position heaps [10].…”
Section: Lemma 1 Let σ Be Any Alphabet and W Be A String Over σ Of Lmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other side in LSTs the two pointers (length and position in the text) associated with any arc (where the total number of arcs is bounded by 2n − 2) are replaced with one symbol. Anyhow, the LSTs cannot compete for instance, in terms of space efficiency, with some relatively new data structures, such as compressed suffix trees [24] and position heaps [10].…”
Section: Lemma 1 Let σ Be Any Alphabet and W Be A String Over σ Of Lmentioning
confidence: 99%
“…Patricias [22] posed this issue but required to know the length of the substring to be skipped during searching. Some related questions have been addressed for other variants of suffix trees, such as position heaps [10,23], where the ideas proposed are completely different from those in this paper as the underlying structure is a heap rather than a trie. Also, the trick of encoding the input string as a path and its substrings as subpaths 1 does not answer the question as conceptually the arcs of the suffix tree are still (implicitly) labeled with substrings.…”
Section: Introductionmentioning
confidence: 96%
“…Finally, in Section 7 we show that an entirely different data structure, the position heap of Ehrenfeucht et al [8], yields a completely different tradeoff for indexing a sparse set of positions. Position heaps are in a sense "easier" to compute than suffix trees or suffix arrays, since it is not necessary to sort the entire suffixes.…”
Section: Our Resultsmentioning
confidence: 99%
“…The position heap H T over a text T 1,n is a blend of a trie over T 's suffixes and a heap over its indices [8]:…”
Section: Position Heapsmentioning
confidence: 99%
See 1 more Smart Citation