Sets

Bawa, Mayank; Manku, Gurmeet Singh; Raghavan, Prabhakar

doi:10.1145/860435.860491

Cited by 117 publications

(5 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In order to compare the search efficiency enabled by each of the approaches we define a cost metric. The total amount of work required to process a query against a collection is typically proportional to the number of documents that match the query 6 . The volume of index data transferred and processed for a query (the dominant cost) as well as the computational cost 4 http://trec.nist.gov/data/docs eng.html 5 http://www.lemurproject.org/indri/ 6 Every document that contains at least one of the query terms is considered as a matched document for the query.…”

Section: Methodsmentioning

confidence: 99%

“…Query routing is complicated further by the fact that peers are not naturally topically clustered [31]. Hence, clustering peers by content [14,20] is a possible solution as it has been shown to make query processing more efficient [6].…”

Section: Related Workmentioning

confidence: 99%

“…A topical cluster of cooperative peers is similar to a topical shard. However, no mechanism exists to dynamically determine the number of peer clusters to contact for a given query in this set-up, for example: Bawa et al [6] proposed a heuristic cutoff equal to a quarter of the number of topical clusters. Furthermore, clustering mechanism may fail to discover the structure of the document distribution and may not work well for unpopular topics.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Shard ranking and cutoff estimation for topically partitioned collections

Kulkarni

Tigelaar

Hiemstra

et al. 2012

Proceedings of the 21st ACM International Conference on Information and Knowledge Management

View full text Add to dashboard Cite

Large document collections can be partitioned into topical shards to facilitate distributed search [19]. In a low-resource search environment only a few of the shards can be searched in parallel. Such a search environment faces two intertwined challenges. First, determining which shards to consult for a given query: shard ranking. Second, how many shards to consult from the ranking: cutoff estimation. In this paper we present a family of three algorithms that address both of these problems. As a basis we employ a commonly used data structure, the central sample index (CSI) [29], to represent the shard contents. Running a query against the CSI yields a flat document ranking that each of our algorithms transforms into a tree structure. A bottom up traversal of the tree is used to infer a ranking of shards and also to estimate a stopping point in this ranking that yields cost-effective selective distributed search. As compared to a state-of-theart shard ranking approach the proposed algorithms provide substantially higher search efficiency while providing comparable search effectiveness.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Shard ranking and cutoff estimation for topically partitioned collections

Kulkarni

Tigelaar

Hiemstra

et al. 2012

Proceedings of the 21st ACM International Conference on Information and Knowledge Management

View full text Add to dashboard Cite

show abstract

“…In comparison, some other researchers take advantage of the different types of nodes' shared resources to improve the search efficiency in P2P networks by resource topic segment technology [16], the key idea is to arrange participating nodes in a topic-segmented overlay topology in which most connections are short-distance, connecting pairs of nodes with similar resources. Topically focused sets of nodes are then joined together into a single network by long-distance links.…”

Section: Node Resource Heterogeneitymentioning

confidence: 99%

Leveraging Heterogeneity Make Resource Location Efficient in Peer-to-Peer Networks

Gong¹,

Wen²,

Deng

et al. 2011

AMR

View full text Add to dashboard Cite

Although the original intent of the peer-to-peer (P2P) concept is to treat each participant equally, the heterogeneity is an inherent characteristic of P2P systems. In this paper, according to the previous researches in P2P resource sharing networks, we first conclude and classify the heterogeneity in P2P networks, and then analyze each type of P2P systems that utilize heterogeneity in P2P network. The above analysis and conclusion will become a good guidance to design new resource locating algorithm which takes advantage of heterogeneity in P2P networks.

show abstract

“…In order to improve the efficiency of information retrieval in P2P networks, there have been many researches on categorizing data items based on their similarities and clustering peers that have similar data items into the same category [1] - [8], [10] - [13]. These methods propagate query messages to all peers in a cluster.…”

Section: Introductionmentioning

confidence: 99%

A Dynamic Cluster Construction Method Based on Query Characteristics in Peer-to-Peer Networks

Kobayashi

Watanabe

Kanzaki

et al. 2009

2009 First International Conference on Advances in P2P Systems

View full text Add to dashboard Cite

To improve the efficiency of information retrieval in P2P networks, there have been many researches on categorizing data items and clustering peers. In almost all these researches, the number of categories and the policy of categorization are predetermined and static. However, users' requirements for information retrieval dynamically change. This leads to undesired increase of network traffic. In this paper, we propose a dynamic cluster construction method based on query characteristics and a search method using dynamic cluster. Our method dynamically constructs clusters, when the access frequencies for certain data items increase. This approach can reduce the number of query messages for searching data items further than static clustering methods.

show abstract

Sets

Cited by 117 publications

References 39 publications

Shard ranking and cutoff estimation for topically partitioned collections

Shard ranking and cutoff estimation for topically partitioned collections

Leveraging Heterogeneity Make Resource Location Efficient in Peer-to-Peer Networks

A Dynamic Cluster Construction Method Based on Query Characteristics in Peer-to-Peer Networks

Contact Info

Product

Resources

About