2020
DOI: 10.3390/sym12101666
|View full text |Cite
|
Sign up to set email alerts
|

An Analysis of the KDD99 and UNSW-NB15 Datasets for the Intrusion Detection System

Abstract: The significant increase in technology development over the internet makes network security a crucial issue. An intrusion detection system (IDS) shall be introduced to protect the networks from various attacks. Even with the increased amount of works in the IDS research, there is a lack of studies that analyze the available IDS datasets. Therefore, this study presents a comprehensive analysis of the relevance of the features in the KDD99 and UNSW-NB15 datasets. Three methods were employed: a rough-set theory (… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
31
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 66 publications
(31 citation statements)
references
References 34 publications
0
31
0
Order By: Relevance
“…While the model proposed by Koroniotis et al (2017) achieved 93.23% accuracy using DT classifier. In addition, none of the studies listed in Table 1 have resolved the class imbalance problem of the UNSW-NB15 dataset as there are many studies ( Al-Daweri et al, 2020 ; Ahmad et al, 2021 ; Bagui & Li, 2021 ; Dlamini & Fahim, 2021 ) that have highlighted this issue. We addressed the class imbalance problem by applying SMOTE that improved the performance of the classifiers and achieved good results.…”
Section: Discussionmentioning
confidence: 99%
“…While the model proposed by Koroniotis et al (2017) achieved 93.23% accuracy using DT classifier. In addition, none of the studies listed in Table 1 have resolved the class imbalance problem of the UNSW-NB15 dataset as there are many studies ( Al-Daweri et al, 2020 ; Ahmad et al, 2021 ; Bagui & Li, 2021 ; Dlamini & Fahim, 2021 ) that have highlighted this issue. We addressed the class imbalance problem by applying SMOTE that improved the performance of the classifiers and achieved good results.…”
Section: Discussionmentioning
confidence: 99%
“…The Kaggle version of the KDD Cup 99 [30], [31] is available an online dataset. The dataset is composed by a total of 25192 TCP/IP connections (observations) from a simulated typical US Air Force LAN.…”
Section: A Kdd99mentioning
confidence: 99%
“…There are a number of attractive datasets for network intrusion detection investigating. For instance, KDD 99 is one of the most utilized in studying network IDS [12,47,75]. The KDD 99 is public, that is a benchmark to evaluate performance between provided approached.…”
Section: Lack Of Dedicated Apt Network Intrusion Datasetmentioning
confidence: 99%