Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval 2011
DOI: 10.1145/2009916.2009992
|View full text |Cite
|
Sign up to set email alerts
|

Inverted indexes for phrases and strings

Abstract: Inverted indexes are the most fundamental and widely used data structures in information retrieval. For each unique word occurring in a document collection, the inverted index stores a list of the documents in which this word occurs. Compression techniques are often applied to further reduce the space requirement of these lists. However, the index has a shortcoming, in that only predefined pattern queries can be supported efficiently. In terms of string documents where word boundaries are undefined, if we have… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0
4

Year Published

2014
2014
2017
2017

Publication Types

Select...
6
4

Relationship

2
8

Authors

Journals

citations
Cited by 43 publications
(29 citation statements)
references
References 47 publications
(35 reference statements)
0
24
0
4
Order By: Relevance
“…Contributions came from four continents and seven countries. At least six teams (Team 1, 3-5, 7-9) published highly influential papers on string matching problems before [15,19,22,24,26,28,33], while two teams (Team 2 and Team 6) can be considered as newcomers. As Table 1 shows, the techniques used cover a broad range and thus subsume a large fraction of previous research in k-approximate string matching.…”
Section: Competition and Methodologymentioning
confidence: 99%
“…Contributions came from four continents and seven countries. At least six teams (Team 1, 3-5, 7-9) published highly influential papers on string matching problems before [15,19,22,24,26,28,33], while two teams (Team 2 and Team 6) can be considered as newcomers. As Table 1 shows, the techniques used cover a broad range and thus subsume a large fraction of previous research in k-approximate string matching.…”
Section: Competition and Methodologymentioning
confidence: 99%
“…Their index, on the other hand, turns out to be very competitive in practice. There exist several other indexes of practical interest [22,6,23,24,25].…”
Section: Sourcementioning
confidence: 99%
“…We compare their best performing variant, GREEDY, in this paper. Culpepper et al [5] adapted the scheme to large natural language text collections (where each word is taken as an atomic symbol), showing that it was competitive with inverted indexes for some queries (see previous work on this line by Patil et al [20]). The seminal work of Hon et al [13] also included succinct variants, which were implemented by Navarro and Valenzuela [19] on top of a compressed representation of D.…”
Section: Basic Conceptsmentioning
confidence: 99%