2012
DOI: 10.1145/2348832.2348837
|View full text |Cite
|
Sign up to set email alerts
|

Design and evaluation of decentralized online clustering

Abstract: Ensuring the efficient and robust operation of distributed computational infrastructures is critical, given that their scale and overall complexity is growing at an alarming rate and that their management is rapidly exceeding human capability. Clustering analysis can be used to find patterns and trends in system operational data, as well as highlight deviations from these patterns. Such analysis can be essential for verifying the correctness and efficiency of the operation of the system, as well as for discove… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 21 publications
0
12
0
Order By: Relevance
“…Thus, in this paper we adopt unsupervised learning, which does not require labelled training data and is used to find structures and patterns in data. For an extensive review on the these solutions we refer to [7] and, specifically on on-line clustering, to [8], [9]. Among the unsupervised solutions, we seek an algorithm that can process data fast, can handle mixed types, and ideally could process data in an online fashion.…”
Section: A Clustering As Unsupervised Machine Learningmentioning
confidence: 99%
“…Thus, in this paper we adopt unsupervised learning, which does not require labelled training data and is used to find structures and patterns in data. For an extensive review on the these solutions we refer to [7] and, specifically on on-line clustering, to [8], [9]. Among the unsupervised solutions, we seek an algorithm that can process data fast, can handle mixed types, and ideally could process data in an online fashion.…”
Section: A Clustering As Unsupervised Machine Learningmentioning
confidence: 99%
“…DOC, introduced by Quiroz et al [6,13], was created to provide online and decentralized data analysis, using the collective computing resources in distributed systems. DOC is an online algorithm, thereby allowing short-term system behavior to be captured, as opposed to an offline approach that cannot capture short-term behavior.…”
Section: Decentralized Online Clustering (Doc)mentioning
confidence: 99%
“…(5) The resulting meta-clusters can now be used to identify these related clusters as a single cluster to be tracked. (6) The trajectory for each temporal cluster is updated by storing features in sorted order according to the analysis window in which they were found.…”
Section: Cluster Tracking Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…The algorithm specification, along with details about its implementation and robustness to failures, were the subject of previous publications [47]. Other applications of DOC have been also studied in the context of autonomic resource provisioning [45,58] and autonomic policy adaptation [46] Here, we explain the main characteristics of the algorithm, and refer the reader to the cited publications for further details.…”
Section: Enterprise Business Data Analyticsmentioning
confidence: 99%