2004
DOI: 10.1023/b:supe.0000040611.25862.d9
|View full text |Cite
|
Sign up to set email alerts
|

A Parallel Hybrid Web Document Clustering Algorithm and its Performance Study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0
3

Year Published

2007
2007
2019
2019

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 31 publications
(17 citation statements)
references
References 17 publications
0
14
0
3
Order By: Relevance
“…In agglomerative algorithms, each document is initially assigned to a different cluster. The algorithm then repeatedly merges pairs of clusters until a certain stopping criterion is met [51]. Conversely, divisive algorithms repeatedly divide the whole documents into a certain number of clusters, increasing the number of clusters at each step.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In agglomerative algorithms, each document is initially assigned to a different cluster. The algorithm then repeatedly merges pairs of clusters until a certain stopping criterion is met [51]. Conversely, divisive algorithms repeatedly divide the whole documents into a certain number of clusters, increasing the number of clusters at each step.…”
Section: Introductionmentioning
confidence: 99%
“…Hierarchical clustering algorithms [22,28,38,52] create a hierarchical decomposition of the given dataset which forms dendrograma tree by splitting the dataset recursively into smaller subsets, representing the documents in a multi-level structure [14,21]. The hierarchical algorithms can be further divided into either agglomerative or divisive algorithms [51]. In agglomerative algorithms, each document is initially assigned to a different cluster.…”
Section: Introductionmentioning
confidence: 99%
“…The k-means algorithm [10] (with its many variants) is a popular clustering method for text and web collections [17,18]. It gained its popularity due to its simplicity and intuition.…”
Section: Clusteringmentioning
confidence: 99%
“…This idea was described in [31] and also resolves the indeterminacy in k-means but is fundamentally different from our proposed k-means steered PDDP variants.…”
Section: An Ordering Of Its Columns Then the Optimal Cut-point For 2mentioning
confidence: 99%