2018 IEEE International Conference on Information Reuse and Integration (IRI) 2018
DOI: 10.1109/iri.2018.00041
|View full text |Cite
|
Sign up to set email alerts
|

Tackling Class Imbalance in Cyber Security Datasets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(17 citation statements)
references
References 10 publications
0
14
0
Order By: Relevance
“…21 Wheelus et al showed that the pre-processing of datasets to address class imbalance does indeed provide some benefit using UNSW-NB15 dataset. 23 Pattawaro and Polprasert obtained 86.36% detection rate with the proposed hybrid model that uses combination of feature selection, K-means and XGBoost methods using NSL-KDD dataset. 24 Mirza developed an ensemble method that combines logistic regression, neural networks, and decision trees and obtained 96.14% detection rate using KDDCup99 dataset.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…21 Wheelus et al showed that the pre-processing of datasets to address class imbalance does indeed provide some benefit using UNSW-NB15 dataset. 23 Pattawaro and Polprasert obtained 86.36% detection rate with the proposed hybrid model that uses combination of feature selection, K-means and XGBoost methods using NSL-KDD dataset. 24 Mirza developed an ensemble method that combines logistic regression, neural networks, and decision trees and obtained 96.14% detection rate using KDDCup99 dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Various methods have been proposed in the literature for network anomaly detection including standard machine learning classifiers 4–29 and deep learning techniques 30–47 . Muda et al performed clustering before classification and compared the single classifiers with hybrid classifiers.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The class imbalance problem in datasets has been investigated in the literature by some researchers. Wheelus et al [20] used bagging, undersampling and synthetic minority oversampling on the UNSW-NB15 dataset [15] to tackle the problem. Their experiments on this network-flow-only dataset have shown significant performance improvement in terms of classification accuracy.…”
Section: Class Imbalancementioning
confidence: 99%
“…Real-world data from the domain of medical [22], text [36], software defect prediction [2], and fraud detection [31] often have significant imbalance between target classes. In a binary classification dataset with class-imbalance, the class with more instances and the class with fewer instances are known as the majority and the minority class respectively.…”
Section: Introductionmentioning
confidence: 99%