2017 IEEE International Conference on Data Mining Workshops (ICDMW) 2017
DOI: 10.1109/icdmw.2017.116
|View full text |Cite
|
Sign up to set email alerts
|

The Mean and Median Criteria for Kernel Bandwidth Selection for Support Vector Data Description

Abstract: Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice for the kernel function. The Gaussian kernel has a bandwidth parameter, whose value is important for good results. A s… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(19 citation statements)
references
References 10 publications
0
19
0
Order By: Relevance
“…We set the bandwidth of the kernel ( = 1∕(2s 2 ) ) according to the mean criterion proposed by Chaudhuri et al (2017) with p = 1:…”
Section: Methodsmentioning
confidence: 99%
“…We set the bandwidth of the kernel ( = 1∕(2s 2 ) ) according to the mean criterion proposed by Chaudhuri et al (2017) with p = 1:…”
Section: Methodsmentioning
confidence: 99%
“…To perform the bagging in S.B, we set the number K to 3. For the classifier, we use the Parzen Window Classifier as described in Chapelle (2005) using a Gaussian kernel and the mean-bandwidth heuristic as proposed in Chaudhuri et al (2017). The classifier is trained using a sliding window spanning the last 500 instances, i.e., w = 500 .…”
Section: Design Of Experimentsmentioning
confidence: 99%
“…[54] refer the median of the pairwise Euclidean distances as a common choice. However, [15] demonstrate that the use of the average and the median of Euclidean distances to estimate σ produce similar clustering results for the majority of situations. They justify the use of the average distances by its simplicity and fast computation even when the dataset is large.…”
Section: Choice Of Kernel: the Radial Basis Function Kernelmentioning
confidence: 79%