2021
DOI: 10.48550/arxiv.2110.05977
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Datasets are not Enough: Challenges in Labeling Network Traffic

Abstract: In contrast to previous surveys, the present work is not focused on reviewing the datasets used in the network security field. The fact is that many of the available public labeled datasets represent the network behavior just for a particular time period. Given the rate of change in malicious behavior and the serious challenge to label, and maintain these datasets, they become quickly obsolete. Therefore, this work is focused on the analysis of current labeling methodologies applied to network-based data. In t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 57 publications
(141 reference statements)
0
1
0
Order By: Relevance
“…The characteristics of obtained data are one of the challenges to tackle to achieve successful deployment of ML-based NIDS. Researchers often use benchmark datasets which contain features unobtainable in real-time [51] [25]. Flow-based features provide a useful overview of the activity of a network [1] [24].…”
Section: Methods 31 Flow-based Datamentioning
confidence: 99%
“…The characteristics of obtained data are one of the challenges to tackle to achieve successful deployment of ML-based NIDS. Researchers often use benchmark datasets which contain features unobtainable in real-time [51] [25]. Flow-based features provide a useful overview of the activity of a network [1] [24].…”
Section: Methods 31 Flow-based Datamentioning
confidence: 99%