Proceedings of the IEEE/LEOS 3rd International Conference on Numerical Simulation of Semiconductor Optoelectronic Devices (IEEE
DOI: 10.1109/laweb.2003.1250291
|View full text |Cite
|
Sign up to set email alerts
|

Multi-tier architecture for Web search engines

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(23 citation statements)
references
References 5 publications
0
22
0
Order By: Relevance
“…It is common for search engines, in both, research and commercial systems, to partition large document collection into multiple tiers and shards (disjoint indexes) [4,3,17].…”
Section: Introductionmentioning
confidence: 99%
“…It is common for search engines, in both, research and commercial systems, to partition large document collection into multiple tiers and shards (disjoint indexes) [4,3,17].…”
Section: Introductionmentioning
confidence: 99%
“…Our work is different from the works in [4,17] as follows. The work in [5] reports only four sample query times, and the work in [4] does not include any performance evaluation, while we analyse the strategy for partitioning a collection by document, broken down overall performance in costs of critical phases of query execution and identified a set of design trade-offs over a distributed architecture.…”
Section: Related Workmentioning
confidence: 88%
“…This issue is outside the scope of our work, but it is one that deserves attention, particularly by the community of systems performance and operating systems. The Google and FAST search engine architectures are presented in [4,5,17]. In the first phase of query execution, index servers consult an inverted index and determine a set of relevant documents.…”
Section: Throughputmentioning
confidence: 99%
See 1 more Smart Citation
“…The first two factors can be easily included while computing scores as outlined above. The most commonly used way to integrate the other factors is to precompute a global importance score for each document, as done in PageRank [9], or a few importance scores for different topic groups [18], and to simply add these scores to the term-based scores during query execution [29,24,30]. Our approach does not depend on the ranking function as long as the total cost is dominated by the inverted list traversal.…”
Section: Term-based Rankingmentioning
confidence: 99%