Grenville Armitage scite author profile

The research community has begun looking for IP traffic classification techniques that do not rely on 'well known' TCP or UDP port numbers, or interpreting the contents of packet payloads. New work is emerging on the use of statistical traffic characteristics to assist in the identification and classification process. This survey paper looks at emerging research into the application of Machine Learning (ML) techniques to IP traffic classification-an inter-disciplinary blend of IP networking and data mining techniques. We provide context and motivation for the application of ML techniques to IP traffic classification, and review 18 significant works that cover the dominant period from 2004 to early 2007. These works are categorized and reviewed according to their choice of ML strategies and primary contributions to the literature. We also discuss a number of key requirements for the employment of ML-based traffic classifiers in operational IP networks, and qualitatively critique the extent to which the reviewed works meet these requirements. Open issues and challenges in the field are also discussed.

show abstract

A survey of covert channels and countermeasures in computer network protocols

Zander

Armitage

Branch

2007

IEEE Commun. Surv. Tutorials

455

370

View full text Add to dashboard Cite

Covert channels are used for the secret transfer of information. Encryption only protects communication from being decoded by unauthorised parties, whereas covert channels aim to hide the very existence of the communication. Initially, covert channels were identified as a security threat on monolithic systems i.e. mainframes. More recently focus has shifted towards covert channels in computer network protocols. The huge amount of data and vast number of different protocols in the Internet seems ideal as a high-bandwidth vehicle for covert communication. This article is a survey of the existing techniques for creating covert channels in widely deployed network and application protocols. We also give an overview of common methods for their detection, elimination, and capacity limitation, required to improve security in future computer networks.

show abstract

A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classification

Williams

Zander

Armitage

2006

SIGCOMM Comput. Commun. Rev.

585

313

View full text Add to dashboard Cite

The identification of network applications through observation of associated packet traffic flows is vital to the areas of network management and surveillance. Currently popular methods such as port number and payload-based identification exhibit a number of shortfalls. An alternative is to use machine learning (ML) techniques and identify network applications based on per-flow statistics, derived from payload-independent features such as packet length and inter-arrival time distributions. The performance impact of feature set reduction, using Consistency-based and Correlation-based feature selection, is demonstrated on Naïve Bayes, C4.5, Bayesian Network and Naïve Bayes Tree algorithms. We then show that it is useful to differentiate algorithms based on computational performance rather than classification accuracy alone, as although classification accuracy between the algorithms is similar, computational performance can differ significantly.

show abstract

Automated traffic classification and application identification using machine learning

2005

View full text Add to dashboard Cite

show abstract

A Survey of Rate Adaptation Techniques for Dynamic Adaptive Streaming Over HTTP

Kua

Armitage

Branch

2017

IEEE Commun. Surv. Tutorials

232

113

View full text Add to dashboard Cite

An experimental estimation of latency sensitivity in multiplayer quake 3

Armitage

138

101

View full text Add to dashboard Cite

Abstract-Quantifying the latency sensitivity of potential customers/players is critical for Internet-based game providers when planning the network placement of their game servers. In early 2001 we placed two Quake 3 servers at different locations on the Internet, and instrumented them to gather median latency information on every player who played over a multimonth period. Comparison of server logfiles showed an active yet distinct player population on each server, and the median latency distributions suggest players actively prefer Quake 3 servers less than 150 to 180 milliseconds from the player's location. Quake 3 is often played as a multiplayer, Internetbased, highly interactive "first person shooter" game. Although Quake 3 is nowhere near as popular as games such as Halflife:CounterStrike, we believe our results provide a useful ballpark indicator of latency sensitivity for this class of highly interactive online games.

show abstract

Training on multiple sub-flows to optimise the use of Machine Learning classifiers in real-world IP networks

Nguyễn

Armitage

2006

View full text Add to dashboard Cite

Literature on the use of machine learning (ML) algorithms for classifying IP traffic has relied on fullflows or the first few packets of flows. In contrast, many real-world scenarios require a classification decision well before a flow has finished even if the flow's beginning is lost. This implies classification must be achieved using statistics derived from the most recent N packets taken at any arbitrary point in a flow's lifetime. We propose training the classifier on a combination of short sub-flows (extracted from fullflow examples of the target application's traffic). We demonstrate this optimisation using the Naïve Bayes ML algorithm, and show that our approach results in excellent performance even when classification is initiated mid-way through a flow with windows as small as 25 packets long. We suggest future use of unsupervised ML algorithms to identify optimal subflows for training.

show abstract

A Framework for IP Based Virtual Private Networks

Gleeson¹,

Lin²,

Heinänen³

et al. 2000

101

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.