2006
DOI: 10.1145/1163593.1163596
|View full text |Cite
|
Sign up to set email alerts
|

A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classification

Abstract: The identification of network applications through observation of associated packet traffic flows is vital to the areas of network management and surveillance. Currently popular methods such as port number and payload-based identification exhibit a number of shortfalls. An alternative is to use machine learning (ML) techniques and identify network applications based on per-flow statistics, derived from payload-independent features such as packet length and inter-arrival time distributions. The performance impa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

3
313
1
1

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 586 publications
(319 citation statements)
references
References 8 publications
3
313
1
1
Order By: Relevance
“…In [45], Williams et al compared three different Bayesian algorithms (Naïve Bayes with kernel density estimation, Naïve Bayes with discretization, Bayesian network and Naïve Bayes tree algorithms) with a deterministic machine learning algorithm (C4.5 decision tree). All of these classifiers were implemented from the WEKA toolbox.…”
Section: Probabilistic Machine Learning Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…In [45], Williams et al compared three different Bayesian algorithms (Naïve Bayes with kernel density estimation, Naïve Bayes with discretization, Bayesian network and Naïve Bayes tree algorithms) with a deterministic machine learning algorithm (C4.5 decision tree). All of these classifiers were implemented from the WEKA toolbox.…”
Section: Probabilistic Machine Learning Methodsmentioning
confidence: 99%
“…The optimal subset of features excludes the redundant features that are not relevant for classification. Williams et al compare these two methods extensively in [45].…”
Section: Feature Selectionmentioning
confidence: 99%
See 2 more Smart Citations
“…Previous research showed that for classification of network traffic the better ML techniques provide similar accuracy, but differ greatly regarding training time and classification speed [15]. We used the C4.5 decision tree classifier [16] -more precisely its implementation in the Waikato Environment for Knowledge Analysis (WEKA) [17], because it had performed well previously [15].…”
Section: Machine Learningmentioning
confidence: 99%