2011
DOI: 10.3233/ida-2010-0466
|View full text |Cite
|
Sign up to set email alerts
|

Exploring discrepancies in findings obtained with the KDD Cup '99 data set

Abstract: The KDD Cup '99 data set has been widely used to evaluate intrusion detection prototypes, most based on machine learning techniques, for nearly a decade. The data set served well in the KDD Cup '99 competition to demonstrate that machine learning can be useful in intrusion detection systems. However, there are discrepancies in the findings reported in the literature. Further, some researchers have published criticisms of the data (and the DARPA data from which the KDD Cup '99 data has been derived), questionin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 41 publications
(12 citation statements)
references
References 65 publications
0
12
0
Order By: Relevance
“…Eventually, the proposed model will be employed for classification of new samples. For evaluation of the reliability of the suggested approach, a number of experiments are conducted using KDD Cup 1999 data which is known to be a popular dataset and has been widely used by the researchers [39][40][41][42]. The results of experiments and their investigations are described in the subsequent part.…”
Section: Feature Selection Methods Ga-svmmentioning
confidence: 99%
“…Eventually, the proposed model will be employed for classification of new samples. For evaluation of the reliability of the suggested approach, a number of experiments are conducted using KDD Cup 1999 data which is known to be a popular dataset and has been widely used by the researchers [39][40][41][42]. The results of experiments and their investigations are described in the subsequent part.…”
Section: Feature Selection Methods Ga-svmmentioning
confidence: 99%
“…Moreover, generalization of results obtained from Solarisbased system is not logical since Solaris-based system has limited usage in cyber community compared to Linux [25][26] . The most critical point in KDD collection is the existence of several data artifacts within it which make it not reliable [27] .…”
Section: Kdd Datasetsmentioning
confidence: 99%
“…It has been recommended for evaluating the performance of an anomalybased IDS in detecting new intrusions. Due to the reason that the primary concern to an anomaly-based IDS is its accuracy in modelling normal traffic behaviour of a network, the age of data does not prevent a fair evaluation on the system [43]. Moreover, testing our approach using KDD Cup 99 data set contributes convincing evaluations and comparisons with other related state-of-the-art techniques [11] [12].…”
Section: System Evaluationmentioning
confidence: 99%