2019
DOI: 10.1111/exsy.12473
|View full text |Cite
|
Sign up to set email alerts
|

A method for outlier detection based on cluster analysis and visual expert criteria

Abstract: Outlier detection is an important problem occurring in a wide range of areas. Outliers are the outcome of fraudulent behaviour, mechanical faults, human error, or simply natural deviations. Many data mining applications perform outlier detection, often as a preliminary step in order to filter out outliers and build more representative models. In this paper, we propose an outlier detection method based on a clustering process. The aim behind the proposal outlined in this paper is to overcome the specificity of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 46 publications
(60 reference statements)
0
5
0
Order By: Relevance
“…K-means algorithm has the advantages of relatively simple principle, easy implementation, high convergence speed and strong interpretability [9] . The basic idea of this paper is to divide the sample points into several clusters according to the distance between the sample points for a given sample data set, so that the distance between the sample points in the cluster is as close as possible, and the distance between the sample points in the cluster is as far as possible.…”
Section: K-means Clustering Algorithmmentioning
confidence: 99%
“…K-means algorithm has the advantages of relatively simple principle, easy implementation, high convergence speed and strong interpretability [9] . The basic idea of this paper is to divide the sample points into several clusters according to the distance between the sample points for a given sample data set, so that the distance between the sample points in the cluster is as close as possible, and the distance between the sample points in the cluster is as far as possible.…”
Section: K-means Clustering Algorithmmentioning
confidence: 99%
“…The Clustering-based method used in this study creates clusters of the outlier scores using hierarchical clustering, classifying objects within clusters as "normal" and objects outside as "outliers" [42].…”
Section: Thresholdingmentioning
confidence: 99%
“…Approaches. In the clustering-based outlier detection approaches [36,37], it is necessary to define and calculate the distance or similarity metric between two data instances; then, based on the metric, the data instances that are far away from their closest cluster centroid or where their density is below a threshold are declared as outliers. e k-means is one of the most wellknown clustering-based algorithms; it has been widely used in outlier detection since its simplicity and efficiency.…”
Section: Clustering-based Outlier Detectionmentioning
confidence: 99%