Proceedings of the Twelfth International Conference on Information and Knowledge Management 2003
DOI: 10.1145/956863.956945
|View full text |Cite
|
Sign up to set email alerts
|

Operational requirements for scalable search systems

Abstract: Prior research into search system scalability has primarily addressed query processing efficiency [1, 2, 3] or indexing efficiency [3], or has presented some arbitrary system architecture [4]. Little work has introduced any formal theoretical framework for evaluating architectures with regard to specific operational requirements, or for comparing architectures beyond simple timings [5] or basic simulations [6,7]. In this paper, we present a framework based upon queuing network theory for analyzing search syste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2004
2004
2016
2016

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 28 publications
(11 citation statements)
references
References 36 publications
0
11
0
Order By: Relevance
“…Such algorithms could improve retrieval effectiveness by assisting in query disambiguation, making it easier to determine what information need is being expressed by a query at a given time. They could also assist research in search efficiency that takes into account query arrival-rates [3].…”
mentioning
confidence: 99%
“…Such algorithms could improve retrieval effectiveness by assisting in query disambiguation, making it easier to determine what information need is being expressed by a query at a given time. They could also assist research in search efficiency that takes into account query arrival-rates [3].…”
mentioning
confidence: 99%
“…As a consequence of these tight performance constraints, search engines construct and maintain very large compute infrastructures for crawling the Web, indexing discovered pages, and processing user queries [3]. The scalability and efficiency of these infrastructures require careful performance optimizations in every major component of the search engine [2]. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.…”
Section: Descriptionmentioning
confidence: 99%
“…The number of cluster replicas is determined based on a constraint on the peak sustainable throughput. In addition, the utilization of search nodes is taken into account (Chowdhury & Pass, 2003).…”
Section: Query Processing Overviewmentioning
confidence: 99%