2020
DOI: 10.1007/s10994-020-05905-4
|View full text |Cite
|
Sign up to set email alerts
|

Statistical hierarchical clustering algorithm for outlier detection in evolving data streams

Abstract: Anomaly detection is a hard data analysis process that requires constant creation and improvement of data analysis algorithms. Using traditional clustering algorithms to analyse data streams is impossible due to processing power and memory issues. To solve this, the traditional clustering algorithm complexity needed to be reduced, which led to the creation of sequential clustering algorithms. The usual approach is two-phase clustering, which uses online phase to relax data details and complexity, and offline p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 35 publications
0
12
0
Order By: Relevance
“…In this work, agglomerative hierarchical clustering algorithm [ 37 ] is used as the feature grouping method. As shown in Figure 4 , the agglomerative hierarchical clustering method initially treats each feature as a cluster, and then combines the two most similar clusters into a new larger cluster step by step.…”
Section: Proposed Fault Detection Methodsmentioning
confidence: 99%
“…In this work, agglomerative hierarchical clustering algorithm [ 37 ] is used as the feature grouping method. As shown in Figure 4 , the agglomerative hierarchical clustering method initially treats each feature as a cluster, and then combines the two most similar clusters into a new larger cluster step by step.…”
Section: Proposed Fault Detection Methodsmentioning
confidence: 99%
“…Intra-cluster uniformity can be further improved by lowering this threshold. While rigorous methods for removal of outliers exist in the literature (Almeida et al, 2007;Fan et al, 2013;Krleža et al, 2021), we employed this basic outlier detection algorithm as a proof of concept, one that is easy to understand and can be readily applied.…”
Section: Analysis Of Outliersmentioning
confidence: 99%
“…Note that the locations in remote regions can potentially form small clusters or even become isolated from other locations in hierarchical clustering due to the bottom-up nature [29]. In real business, outliers are not allowed to appear.…”
Section: Algorithm 1 Distance Evaluation Considering Only Mandatory C...mentioning
confidence: 99%