2003
DOI: 10.1007/978-3-540-45172-3_19
|View full text |Cite
|
Sign up to set email alerts
|

On the Feasibility of Peer-to-Peer Web Indexing and Search

Abstract: This paper discusses the feasibility of peer-to-peer full-text keyword search of the Web. Two classes of keyword search techniques are in use or have been proposed: flooding of queries over an overlay network (as in Gnutella), and intersection of index lists stored in a distributed hash table. We present a simple feasibility analysis based on the resource constraints and search workload. Our study suggests that the peer-to-peer network does not have enough capacity to make naive use of either of search techniq… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
180
0
2

Year Published

2005
2005
2012
2012

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 203 publications
(182 citation statements)
references
References 15 publications
0
180
0
2
Order By: Relevance
“…A conceptually global but physically distributed directory, which is layered on top of Chord [33], holds compact, aggregated information about the peers' local indexes, to the extent that the individual peers are willing to disclose. Unlike [23], we use the Chord DHT to partition the term space, such that every peer is responsible for the statistics and metadata of a randomized subset of terms within the directory. For failure resilience and availability, the responsibility for a term can be replicated across multiple peers.…”
Section: Minerva Prototype For P2p Web Searchmentioning
confidence: 99%
“…A conceptually global but physically distributed directory, which is layered on top of Chord [33], holds compact, aggregated information about the peers' local indexes, to the extent that the individual peers are willing to disclose. Unlike [23], we use the Chord DHT to partition the term space, such that every peer is responsible for the statistics and metadata of a randomized subset of terms within the directory. For failure resilience and availability, the responsibility for a term can be replicated across multiple peers.…”
Section: Minerva Prototype For P2p Web Searchmentioning
confidence: 99%
“…It is not clear how to perform searches without sacrificing scalability or query completeness. Indeed, the obvious approaches to distributed full-text document search scale poorly [9].In this paper, however, we consider systems, such as file sharing, that search only over a relatively small amount of metadata associated with each file, but that have to support highly dynamic and unstable network topology, content, …”
mentioning
confidence: 99%
“…However, more research is needed for evaluating if the cost of distributed processing of relevance feedback queries is within reasonable bounds. [13] suggest that naive use of inverted files in large networks is beyond reasonable communication cost even when proceeding queries with few query terms.…”
Section: Adaptation To the Usermentioning
confidence: 99%