2014
DOI: 10.1007/978-3-319-11918-2_22
|View full text |Cite
|
Sign up to set email alerts
|

Performance Improvements for Search Systems Using an Integrated Cache of Lists+Intersections

Abstract: Modern information retrieval systems use sophisticated techniques for e ciency and scalability purposes. Among the most frequent such techniques is the implementation of several levels of caching. The main goal of a cache is to speedup computation by exploiting frequent, recent or costly data used in the past. In this study we propose and evaluate a static cache that works simultaneously as list and intersection cache, o↵ering a more e cient way of handling cache space. In addition, we propose e↵ective strateg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 19 publications
0
6
0
Order By: Relevance
“…For query logs simulated using the method in Section 4, we cannot control repeated queries. Experimental settings We experiment with cache sizes of 200, 600 and 1000 MB (cache sizes can vary between 100 MB to 16 GB [14]) and fix the size of a posting to 8 bytes. We use ClueWeb09 cat.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…For query logs simulated using the method in Section 4, we cannot control repeated queries. Experimental settings We experiment with cache sizes of 200, 600 and 1000 MB (cache sizes can vary between 100 MB to 16 GB [14]) and fix the size of a posting to 8 bytes. We use ClueWeb09 cat.…”
Section: Methodsmentioning
confidence: 99%
“…the frequency of t in each document [6]. Posting list caching can reduce the amount of disk I/O involved [13,14] in query processing, affords higher cache utilisation and hit rates than result caching [12], and can combine terms to answer incoming queries. Our contribution: We show that static caching of posting lists can be modelled in a principled manner using constrained combinatorial optimisation (CCO), a standard method that has yielded great improvements in many fields [9,Chap. 35], and we provide a principled investigation of whether CCO would yield better solutions (preferably using modest extra computational resources) than greedy methods.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Li, Lee, Sivasubramaniam, and Giles (2007) propose a hybrid architecture involving result, posting list, and document caches. Tolosa, Becchetti, Feuerstein, and Marchetti-Spaccalema (2014) present a technique that couples posting lists and term intersection lists in a single static cache. Zhou, Li, Dong, Xu, and Xiao (2015) also propose a three-level cache hierarchy consisting of a result cache, posting list cache, and a term intersection cache, generated using the top-N frequent item sets (terms) mined from query logs.…”
Section: Related Workmentioning
confidence: 99%