2014
DOI: 10.1145/2590774
|View full text |Cite
|
Sign up to set email alerts
|

Space-Efficient Frameworks for Top- k String Retrieval

Abstract: The inverted index is the backbone of modern web search engines. For each word in a collection of web documents, the index records the list of documents where this word occurs. Given a set of query words, the job of a search engine is to output a ranked list of the most relevant documents containing the query. However, if the query consists of an arbitrary string-which can be a partial word, multiword phrase, or more generally any sequence of characters-then word boundaries are no longer relevant and we need a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 29 publications
(3 citation statements)
references
References 67 publications
0
3
0
Order By: Relevance
“…It is also worth mentioning that the journal version of the original paper of Hon et al has recently appeared as well [40]. Here they show how to obtain O(p + k) time if the top-k results are not to be returned sorted by relevance.…”
Section: Discussionmentioning
confidence: 90%
“…It is also worth mentioning that the journal version of the original paper of Hon et al has recently appeared as well [40]. Here they show how to obtain O(p + k) time if the top-k results are not to be returned sorted by relevance.…”
Section: Discussionmentioning
confidence: 90%
“…For example, categorical range counting queries (i.e., count the number of different values in a range) requires in general Ω(log n/ log log n) time if using O(n polylog n) space [11], where n is the array size, but if queries form a hierarchy it is easily solved in constant time and O(n) bits [13]. A second example is the range mode problem (i.e., find the most frequent value in a range), which is believed to require time Ω(n 1.188 ) if using O(n 1.188 ) space [4], but if queries form a hierarchy it is easily solved in constant time and linear space [8].…”
Section: Introductionmentioning
confidence: 99%
“…In this paper we aim at a compact data structure to represent data cubes where the domains in each dimension are hierarchical. Following the general idea of the tailored solutions to the problems we mentioned [13,8], our structure partitions the space according to the hierarchies, instead of performing a regular partition like generic multidimensional structures. Therefore, the queries of interest for OLAP applications, which combine nodes of the different hierarchies, will require aggregating the information of just a few nodes in our partitions, much fewer than if we used a generic space partitioning method.…”
Section: Introductionmentioning
confidence: 99%