Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval 2003
DOI: 10.1145/860435.860491
|View full text |Cite
|
Sign up to set email alerts
|

Sets

Abstract: We present SETS, an architecture for efficient search in peer-to-peer networks, building upon ideas drawn from machine learning and social network theory. The key idea is to arrange participating sites in a topic-segmented overlay topology in which most connections are short-distance, connecting pairs of sites with similar content. Topically focused sets of sites are then joined together into a single network by long-distance links. Queries are matched and routed to only the topically closest regions. We discu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2005
2005
2012
2012

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 117 publications
(5 citation statements)
references
References 39 publications
0
4
0
Order By: Relevance
“…In order to compare the search efficiency enabled by each of the approaches we define a cost metric. The total amount of work required to process a query against a collection is typically proportional to the number of documents that match the query 6 . The volume of index data transferred and processed for a query (the dominant cost) as well as the computational cost 4 http://trec.nist.gov/data/docs eng.html 5 http://www.lemurproject.org/indri/ 6 Every document that contains at least one of the query terms is considered as a matched document for the query.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In order to compare the search efficiency enabled by each of the approaches we define a cost metric. The total amount of work required to process a query against a collection is typically proportional to the number of documents that match the query 6 . The volume of index data transferred and processed for a query (the dominant cost) as well as the computational cost 4 http://trec.nist.gov/data/docs eng.html 5 http://www.lemurproject.org/indri/ 6 Every document that contains at least one of the query terms is considered as a matched document for the query.…”
Section: Methodsmentioning
confidence: 99%
“…Query routing is complicated further by the fact that peers are not naturally topically clustered [31]. Hence, clustering peers by content [14,20] is a possible solution as it has been shown to make query processing more efficient [6].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In comparison, some other researchers take advantage of the different types of nodes' shared resources to improve the search efficiency in P2P networks by resource topic segment technology [16], the key idea is to arrange participating nodes in a topic-segmented overlay topology in which most connections are short-distance, connecting pairs of nodes with similar resources. Topically focused sets of nodes are then joined together into a single network by long-distance links.…”
Section: Node Resource Heterogeneitymentioning
confidence: 99%
“…In order to improve the efficiency of information retrieval in P2P networks, there have been many researches on categorizing data items based on their similarities and clustering peers that have similar data items into the same category [1] - [8], [10] - [13]. These methods propagate query messages to all peers in a cluster.…”
Section: Introductionmentioning
confidence: 99%