2021 14th International Conference on Security of Information and Networks (SIN) 2021
DOI: 10.1109/sin54109.2021.9699293
|View full text |Cite
|
Sign up to set email alerts
|

Improving Intrusion Detection Through Training Data Augmentation

Abstract: Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 27 publications
0
2
0
1
Order By: Relevance
“…This is because the number of observations of the other classes are lower than the number of observations in the majority class. As a result, we augmented the dataset by oversampling the minority classes to the number of the majority class by using the approach by [18]. The number of observations in the benign class was used as the baseline for the oversampling of the minority classes.…”
Section: A Gas Pipeline Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…This is because the number of observations of the other classes are lower than the number of observations in the majority class. As a result, we augmented the dataset by oversampling the minority classes to the number of the majority class by using the approach by [18]. The number of observations in the benign class was used as the baseline for the oversampling of the minority classes.…”
Section: A Gas Pipeline Datasetmentioning
confidence: 99%
“…As shown in Table III, the distribution is heavily skewed towards the Normal class, resulting in a highly imbalanced dataset. In light of this, we augmented the minority classes using the approach proposed by [18]. Also, the number of observations in the benign class formed the threshold for oversampling the minority classes.…”
Section: B Water Tank Datasetmentioning
confidence: 99%
“…Nos últimos anos, as técnicas de aumento de dados têm sido cada vez mais utilizadas para fins de construc ¸ão de modelos de AM. Por exemplo, U. Otokwala et al [Otokwala et al 2021] propuseram uma abordagem de aumento de dados para equilibrar a ocorrência de tráfego de rede durante a construc ¸ão do modelo, aumentando a taxa de detecc ¸ão, mas desconsiderando a sua aplicac ¸ão para atualizac ¸ões do modelo. Um aumento do conjunto de dados baseado em GAN foi proposto por G. Andresini et al [Andresini et al 2021] para lidar com o desequilíbrio de classes.…”
Section: Trabalhos Relacionadosunclassified