2021
DOI: 10.5755/j01.itc.50.1.25588
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Density-based Technique for Outlier Detection of High Dimensional Data Utilizing Full Feature Space

Abstract: Recently, anomaly detection has acquired a realistic response from data mining scientists as a graph of its reputation has increased smoothly in various practical domains like product marketing, fraud detection, medical diagnosis, fault detection and so many other fields. High dimensional data subjected to outlier detection poses exceptional challenges for data mining experts and it is because of natural problems of the curse of dimensionality and resemblance of distant and adjoining points. Traditional algori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 40 publications
(43 reference statements)
0
5
0
Order By: Relevance
“…Many previous OD studies [3,13,28,35,36] have not considered the appearing frequency of each data sample in the detection process, thus, the detected outliers could not fit their two characteristics well. Different with these OD methods, the appearing frequency of patterns is also considered as an important factor that will influence the detection accuracy in pattern matching-based method [8], thus, its detection accuracy is much higher.…”
Section: Detecting Outliers In Data Streamsmentioning
confidence: 99%
See 2 more Smart Citations
“…Many previous OD studies [3,13,28,35,36] have not considered the appearing frequency of each data sample in the detection process, thus, the detected outliers could not fit their two characteristics well. Different with these OD methods, the appearing frequency of patterns is also considered as an important factor that will influence the detection accuracy in pattern matching-based method [8], thus, its detection accuracy is much higher.…”
Section: Detecting Outliers In Data Streamsmentioning
confidence: 99%
“…The use of anti-monotonic constraint property can help to reduce extensible patterns, and thus saving the time cost. Many previous OD studies [3,13,28,35,36] have not considered the appearing frequency of each data sample in the detection process, thus, the detected outliers could not fit their two characteristics well. Different with these OD methods, the appearing frequency of patterns is also considered as an important factor that will traditional pattern matching-based OD is very long because of the large amount of RPs can be mined.…”
Section: Anti-monotone Constraint Propertymentioning
confidence: 99%
See 1 more Smart Citation
“…The o represents another point o in the matrix, Nk(p) represents the set of k nearest neighbors of p,and lrdk(p) represents the locally reachable density at point p. kdistance(o) denotes the location of the kth distance from o, excluding the point o. d(o,p) means the distance between the point o and the point p. The formula is ( 17): (17) The larger the LOFk(p) value, the more likely it is to be an outlier. Finally, the outlier points can be derived by performing a descending sorting operation on the local outlier factors of each object in matrix Z and comparing the magnitude with a custom threshold.…”
Section: Search Outliermentioning
confidence: 99%
“…Outlier detection has an extensive range of applications in many fields: bank fraud [2]- [4], video surveillance [5]- [8], network anomalies [9]- [11], finding new celestial objects [12], [13], etc. The available outlier detection algorithms can be broadly classified into: distance-based algorithms [14]- [17], density-based algorithms [18]- [20], clustering-based algorithms [21], [22], statistical methods [23], integration-based methods [24], numerous neural network-based algorithms [25] and graph-based algorithms [26] etc. Most graph algorithms only work on graph-structured data, and the same goes for GCN [27].…”
Section: Introductionmentioning
confidence: 99%