2013
DOI: 10.1613/jair.3623
|View full text |Cite
|
Sign up to set email alerts
|

Toward Supervised Anomaly Detection

Abstract: Anomaly detection is being regarded as an unsupervised learning task as anomalies stem from adversarial or unlikely events with unknown distributions. However, the predictive performance of purely unsupervised anomaly detection often fails to match the required detection rates in many tasks and there exists a need for labeled data to guide the model generation. Our first contribution shows that classical semi-supervised approaches, originating from a supervised classifier, are inappropriate and hardly detect n… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
184
1
3

Year Published

2014
2014
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 292 publications
(200 citation statements)
references
References 53 publications
(49 reference statements)
2
184
1
3
Order By: Relevance
“…For Fraction of all connections of all clients that specified HTTP header field Content-Type as any text variant the detection of SQL-injection, cross-site-scripting (XSS), and PHP file-inclusion (L/RFI), traffic can be modeled based on HTTP header and query string information using HMMs (Ariu et al 2011), n-gram models (Wressnegger et al 2013), general kernels (Düssel et al 2008), or other models (Robertson and Maggi 2010). Anomaly-detection mechanisms were investigated, from centroid anomaly-detection models (Kloft and Laskov 2012) to setting hard thresholds on the likelihood of new HTTP requests given the model, to unsupervised learning of support-vector data description (SVDD) models (Düssel et al 2008, Görnitz et al 2013.…”
Section: Discussion and Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…For Fraction of all connections of all clients that specified HTTP header field Content-Type as any text variant the detection of SQL-injection, cross-site-scripting (XSS), and PHP file-inclusion (L/RFI), traffic can be modeled based on HTTP header and query string information using HMMs (Ariu et al 2011), n-gram models (Wressnegger et al 2013), general kernels (Düssel et al 2008), or other models (Robertson and Maggi 2010). Anomaly-detection mechanisms were investigated, from centroid anomaly-detection models (Kloft and Laskov 2012) to setting hard thresholds on the likelihood of new HTTP requests given the model, to unsupervised learning of support-vector data description (SVDD) models (Düssel et al 2008, Görnitz et al 2013.…”
Section: Discussion and Related Workmentioning
confidence: 99%
“…A binary SVM trained on labeled data has been observed to consistently outperform a one-class SVM using n-gram features (Wressnegger et al 2013). Similarly, augmenting SVDDs with labeled data has been observed to greatly improve detection accuracy (Görnitz et al 2013). Other work has studied SVMs (Khan et al 2007;Li et al 2012) and other classification methods (Koc et al 2012;Peddabachigari et al 2007;Gharibian and Ghorbani 2007).…”
Section: Discussion and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…We use a probabilistic anomaly detection method that can benefit from anomalous examples for the authorship verification process based on a multivariate Gaussian modelling. Given the fact that unsupervised anomaly detection approaches often fail to match the required detection rates in many tasks and there exists a need for labelled data to guide the model generation [7], our proposed methods is weakly supervised in the sense that it takes into consideration a small amount of representative anomalous data for the model generation.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Some examples of these methods are Support Vector Machines (SVMs) and Neural Networks (Erfani, Baktashmotlagh, Rajasegarar, Karunasekera, & Leckie, 2015). Semi-supervised methods require labelled instances of the normal class only, in order to train their detection models, e.g., one-class classifiers (Görnitz, Kloft, Rieck, & Brefeld, 2013). Compared to supervised methods and semi-supervised methods, unsupervised methods, which do not require labelled instances, are more widely used in industry, because obtaining accurate labelled data for anomaly detection often has a very high cost (Chandola et al, 2009).…”
Section: Related Workmentioning
confidence: 99%