1986
DOI: 10.1002/asi.4630370102
|View full text |Cite
|
Sign up to set email alerts
|

Using interdocument similarity information in document retrieval systems

Abstract: The first part of this paper reports a comparative study of the document classifications produced by the use of the single linkage, complete linkage, group average, and Ward clustering methods. Studies of cluster membership and of the effectiveness of cluster searches support previous findings that suggest that the single linkage classifications are rather different from those produced by the other three methods. These latter methods all produce large numbers of small clusters containing just pairs of document… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
20
0
1

Year Published

1991
1991
2006
2006

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 84 publications
(25 citation statements)
references
References 21 publications
3
20
0
1
Order By: Relevance
“…The minimum value of EX and the highest performance level for a random structure is EX = .79; a performance level that compares favorably to current cluster-based retrieval results from a variety of test collections using a variety of clustering criteria (Griffiths et al, 1986).…”
Section: Effectiveness Of Random Structures: Ex(dwts)mentioning
confidence: 87%
See 1 more Smart Citation
“…The minimum value of EX and the highest performance level for a random structure is EX = .79; a performance level that compares favorably to current cluster-based retrieval results from a variety of test collections using a variety of clustering criteria (Griffiths et al, 1986).…”
Section: Effectiveness Of Random Structures: Ex(dwts)mentioning
confidence: 87%
“…When E = 0, P = R = 1.0, and when E = 1, P = R = 0.0. The E measure has been selected because it provides the opportunity to compare results of the current investigation with existing clusterbased retrieval results (Croft, 1980;Van Rijsbergen, 1974a;Van Rijsbergen & Croft, 197.5;Willett, 1984), especially the comprehensive results provided by Griffiths, Luckhurst, & Willett (1986).…”
Section: Effectiveness Measuresmentioning
confidence: 99%
“…Indeed, there is a history of successful applications of the general nearest-neighbor approach (e.g., [9]). Within each iteration, Cluster-Audition scoring consists of two phases.…”
Section: Basic Methods For Scoring Renderersmentioning
confidence: 99%
“…Since this set is query-dependent, at least some of the clustering process must occur at retrieval time, mandating the use of extremely efficient algorithms [6,37]. The approach we adopt is to use overlapping nearest-neighbor clusters, which have formed the basis of effective retrieval algorithms in other work [12,17,19,33]: for each document d ∈ Dinit, we have the cluster {d} ∪ N bhd(d | k − 1, Dinit − {d}), where k is the cluster-size parameter.…”
Section: Graph Constructionmentioning
confidence: 99%