2011
DOI: 10.14778/3402755.3402756
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation strategies for top-k queries over memory-resident inverted indexes

Abstract: Top-k retrieval over main-memory inverted indexes is at the core of many modern applications: from large scale web search and advertising platforms, to text extenders and content management systems. In these systems, queries are evaluated using two major families of algorithms: document-at-a-time (DAAT) and term-at-a-time (TAAT). DAAT and TAAT algorithms have been studied extensively in the research literature, but mostly in disk-based settings. In this paper, we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
7
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 56 publications
(8 citation statements)
references
References 21 publications
1
7
0
Order By: Relevance
“…These approaches tend to be more efficient than are DaaT MaxScore and term-at-a-time (TaaT) approaches [67,68], particularly for short queries, the most common scenario in web search. However, for long queries or large candidate sets, the case is less clear-cut [23,33,54]; moreover, fusion over query variations often leads to very long queries.…”
Section: Efficient Index Traversalmentioning
confidence: 99%
“…These approaches tend to be more efficient than are DaaT MaxScore and term-at-a-time (TaaT) approaches [67,68], particularly for short queries, the most common scenario in web search. However, for long queries or large candidate sets, the case is less clear-cut [23,33,54]; moreover, fusion over query variations often leads to very long queries.…”
Section: Efficient Index Traversalmentioning
confidence: 99%
“…However, if a few n j corresponding to "head" labels are near N, which means almost all data points have the same label, the above cost reaches O(N 2 ). Efficient algorithms for top-k retrieval on an inverted index [11] or finding approximate k-nearest neighbors [12] can be applied to this situation. However, we focus on tail labels and ignore head labels in some cases.…”
Section: Learning To Partition Data Pointsmentioning
confidence: 99%
“…The interest of casting these updates as variants of MIPS problems is to exploit the ideas developed in the literature for solving these problems efficiently. Teflioudi and Gemulla (2016) and Fontoura et al (2011) give good overviews of MIPS solvers developed for recommender systems and information retrieval applications respectively. In both cases, the proposed methods rely on two main ideas: (i) adequate indexing techniques or data structures and (ii) pruning criteria which allow to not compute all inner products entirely.…”
Section: Updating the Working Setmentioning
confidence: 99%