2021
DOI: 10.1016/j.knosys.2021.106830
|View full text |Cite
|
Sign up to set email alerts
|

Archetype analysis: A new subspace outlier detection approach

Abstract: The problem of detecting outliers in multivariate data sets with continuous numerical features is addressed by a new method. This method combines projections into relevant subspaces by archetype analysis with a nearest neighbor algorithm, through an appropriate ensemble of the results. Our method is able to detect an anomaly in a simple data set with a linear correlation of two features, while other methods fail to recognize that anomaly. Our method performs among top in an extensive comparison with 23 state-o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 69 publications
(112 reference statements)
0
3
0
Order By: Relevance
“…Keller et al [29] propose determining high-contrast subspaces for outlier detection as a form of data pre-processing. Cabero et al [30] also determine the subspaces as a data pre-processing step based on archetypal analysis followed by a k th NN approach. Some authors combine distance-based outlier detection with dimensionality reduction techniques such as principal component analysis [31] for high-dimensional data.…”
Section: Nearest Neighborsmentioning
confidence: 99%
“…Keller et al [29] propose determining high-contrast subspaces for outlier detection as a form of data pre-processing. Cabero et al [30] also determine the subspaces as a data pre-processing step based on archetypal analysis followed by a k th NN approach. Some authors combine distance-based outlier detection with dimensionality reduction techniques such as principal component analysis [31] for high-dimensional data.…”
Section: Nearest Neighborsmentioning
confidence: 99%
“…Nguyen et al (2013) propose to find subspaces with strong mutual correlations and then identify anomalies on these subspaces. Finally, Cabero et al (2021) use archetype analysis to project the feature space into various subspaces with linear correlations based on nearest neighbours. On this basis, they explore outliers by ensembling the results obtained on relevant subspaces.…”
Section: Subspace-based Anomaly Detectionmentioning
confidence: 99%
“…As a limitation, in the case of big data, the computation could be slow for archetypal-based methods. In that case, we could use more efficient algorithms, as explained in [39] (e.g., [37,[40][41][42][43]).…”
Section: Suitability Of Cf Tools In Classification Problems With Uncertaintiesmentioning
confidence: 99%