2006
DOI: 10.1007/11687238_12
|View full text |Cite
|
Sign up to set email alerts
|

IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking

Abstract: Abstract. We consider a collaboration of peers autonomously crawling the Web. A pivotal issue when designing a peer-to-peer (P2P) Web search engine in this environment is query routing: selecting a small subset of (a potentially very large number of relevant) peers to contact to satisfy a keyword query. Existing approaches for query routing work well on disjoint data sets. However, naturally, the peers' data collections often highly overlap, as popular documents are highly crawled. Techniques for estimating th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2006
2006
2016
2016

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(26 citation statements)
references
References 40 publications
(48 reference statements)
0
26
0
Order By: Relevance
“…Overlap estimation among data sources have been used in a number of approaches in the area of distributed and P2P information retrieval [4,10,15,18,23,24]. COSCO [10] gathers statistics about coverage and overlap from past queries and uses them to determine in which order the overlapping collections should be accessed to retrieve the most new results in the least number of collections.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Overlap estimation among data sources have been used in a number of approaches in the area of distributed and P2P information retrieval [4,10,15,18,23,24]. COSCO [10] gathers statistics about coverage and overlap from past queries and uses them to determine in which order the overlapping collections should be accessed to retrieve the most new results in the least number of collections.…”
Section: Related Workmentioning
confidence: 99%
“…MIPs have been shown to be the provide a good tradeoff between estimation error and space requirements [15,6]. In addition, MIPs of different lengths can be compared, which can be beneficial for datasets of different sizes.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…(Cuenca- Acuna et al, 2003;Reynolds and Vahdat, 2003;Suel et al, 2003;Zhang and Suel, 2005;Michel et al, 2005aMichel et al, , 2006 When using a distributed global index, a multiterm query requires multiple lookups in the distributed hash table. The posting lists for all terms need to be intersected to find the documents that contain all query terms.…”
Section: Approximate Intersection Of Posting Lists With Bloom Filtersmentioning
confidence: 99%
“…Many proposals have been made in the literature, for example: globally available term statistics about the peers' contents [8,16,27,2], epidemic routing using gossiping strategies [18], routing indices with peer summaries from local neighborhoods [10,21], statistical synopses such as Bloom filters or hash sketches maintained in a directory based on distributed hash tables (DHT) [5,28,32], randomized expander graphs with low-diameter guarantees [25,26] and randomized rendezvous [31], clustering of thematically related peers [11,12,23], superpeer-based hierarchical networks [24,22], cost/benefit optimization based on coarse-grained global knowledge [29,30], and many more.…”
Section: Motivationmentioning
confidence: 99%