2005
DOI: 10.2298/csis0501103h
|View full text |Cite
|
Sign up to set email alerts
|

FP-outlier: Frequent pattern based outlier detection

Abstract: An outlier in a dataset is an observation or a point that is considerably dissimilar to or inconsistent with the remainder of the data. Detection of such outliers is important for many applications and has recently attracted much attention in the data mining research community. In this paper, we present a new method to detect outliers by discovering frequent patterns (or frequent itemsets) from the data set. The outliers are defined as the data transactions that contain less frequent patterns in their itemsets… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
109
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 145 publications
(109 citation statements)
references
References 1 publication
0
109
0
Order By: Relevance
“…Most existing categorical data oriented methods are based on a general assumption that anomalies lie in regions of low frequency (Akoglu et al, 2012;Ghoting, Otey, & Parthasarathy, 2004;He et al, 2005;Koufakou, Ortiz, Georgiopoulos, Anagnostopoulos, & Reynolds, 2007;Koufakou & Georgiopoulos, 2010;Smets & Vreeken, 2011;He, Deng, Xu, & Huang, 2006). Typical examples are frequent patterns based methods FPOF (He et al, 2005) and infrequent patterns based methods LOADED (Ghoting et al, 2004). FPOF and LOADED build a single model on the entire training set, and identify anomalies based on frequent patterns and infrequent patterns, respectively.…”
Section: Methods For Categorical Datamentioning
confidence: 99%
See 3 more Smart Citations
“…Most existing categorical data oriented methods are based on a general assumption that anomalies lie in regions of low frequency (Akoglu et al, 2012;Ghoting, Otey, & Parthasarathy, 2004;He et al, 2005;Koufakou, Ortiz, Georgiopoulos, Anagnostopoulos, & Reynolds, 2007;Koufakou & Georgiopoulos, 2010;Smets & Vreeken, 2011;He, Deng, Xu, & Huang, 2006). Typical examples are frequent patterns based methods FPOF (He et al, 2005) and infrequent patterns based methods LOADED (Ghoting et al, 2004). FPOF and LOADED build a single model on the entire training set, and identify anomalies based on frequent patterns and infrequent patterns, respectively.…”
Section: Methods For Categorical Datamentioning
confidence: 99%
“…We compared ZERO++ with FPOF (He et al, 2005), COMPREX (Akoglu et al, 2012), iForest (Liu et al, 2012) and LOF (Breunig et al, 2000). FPOF is a state-of-the-art frequency-based method for categorical data.…”
Section: Contenders and Their Parameter Settingsmentioning
confidence: 99%
See 2 more Smart Citations
“…There are a few density-based outlier detection methods, such as [9][10][11][12]. Our method is inherently different from those, since we do not target at outlier objects at all.…”
Section: Related Workmentioning
confidence: 99%