Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval 2010
DOI: 10.1145/1835449.1835466
|View full text |Cite
|
Sign up to set email alerts
|

Caching search engine results over incremental indices

Abstract: A Web search engine must update its index periodically to incorporate changes to the Web. We argue in this paper that index updates fundamentally impact the design of search engine result caches, a performance-critical component of modern search engines. Index updates lead to the problem of cache invalidation: invalidating cached entries of queries whose results have changed. Naïve approaches, such as flushing the entire cache upon every index update, lead to poor performance and in fact, render caching futile… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
75
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(77 citation statements)
references
References 19 publications
2
75
0
Order By: Relevance
“…Sazoglu et al [2013b] investigate hybrid strategies for setting TTL values. Other works [Alicia et al 2011;Blanco et al 2010aBlanco et al , 2010bBortnikov et al 2011; propose more sophisticated mechanisms. Herein, we do not further discuss these studies, since the cache freshness issue is not in the scope of our work.…”
Section: Search Results Cachingmentioning
confidence: 99%
“…Sazoglu et al [2013b] investigate hybrid strategies for setting TTL values. Other works [Alicia et al 2011;Blanco et al 2010aBlanco et al , 2010bBortnikov et al 2011; propose more sophisticated mechanisms. Herein, we do not further discuss these studies, since the cache freshness issue is not in the scope of our work.…”
Section: Search Results Cachingmentioning
confidence: 99%
“…Caching documents for popular web queries is widely employed in practice. Blanco et al [5] propose a scheme to invalidate cached results when new documents arrive. Invalidation causes re-execution of the query when it is invoked by a user.…”
Section: Real-time Tweet Indexingmentioning
confidence: 99%
“…Serving costs comparison to the pub-sub architecture. We continue by further comparing our approach with a caching scheme, along the lines proposed by Blanco et al [5], which caches the top-k updates that annotate each story, and invalidates the cached list on arrival of a tweet that could potentially annotate the story. We consider the cost of annotating the 1K most-popular stories shown on a news website in a single day, where each story is represented using its main body of text.…”
Section: Real-time Tweet Indexingmentioning
confidence: 99%
“…The result sets for such queries should have a short time to live, whereas there are queries for which the search results change very rarely, they could be cached much longer. Making informed decisions about invalidation requires knowledge about the rate of change for particular queries [19]. Perhaps this information, or an estimate thereof, could be made an integral part of the search results, similar to the way in which Domain Name System (DNS) records work.…”
Section: We Have Explored Several Fundamental Caching Policies Showinmentioning
confidence: 99%