2008
DOI: 10.1002/cplx.20238
|View full text |Cite
|
Sign up to set email alerts
|

On the parallel complexity of hierarchical clustering and CC‐complete problems

Abstract: Complex data sets are often unmanageable unless they can be subdivided and simplified in an intelligent manner. Clustering is a technique that is used in data mining

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
10
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(15 citation statements)
references
References 29 publications
0
10
0
Order By: Relevance
“…A prescribed flipping sequence is an ordering of edges in which each succeeding edge's labels may be flipped if and only if neither of its labels has already been flipped. This problem is NC-equivalent to the Lexicographically First Maximal Matching Problem, and so CC-complete; see [10] for a list of CCcomplete problems.…”
Section: Resultsmentioning
confidence: 99%
“…A prescribed flipping sequence is an ordering of edges in which each succeeding edge's labels may be flipped if and only if neither of its labels has already been flipped. This problem is NC-equivalent to the Lexicographically First Maximal Matching Problem, and so CC-complete; see [10] for a list of CCcomplete problems.…”
Section: Resultsmentioning
confidence: 99%
“…A variety of techniques have been developed for proving lower bounds on complexity of clustering [2,22,3]. When we run our Hadoop cluster on Amazon Elastic MapReduce, we can easily expand or shrink the number of virtual servers in our cluster depending on our processing needs.…”
Section: Introductionmentioning
confidence: 99%
“…r ← get_radius_centroid(c,C u ) 14: 6 Complexity on the actual demand or the relationship between the data in the dataset. For a covering with fewer sample points, the single linkage method (using the Euclidean distance) in the hierarchical clustering algorithm [24,25] is adopted to merge them to form an ellipsoidal domain, which means combing the most similar pair of clusters into a new cluster. Then, the similarities between the new cluster and the other clusters are updated, and the two most similar clusters are again merged.…”
mentioning
confidence: 99%