Heavy-Hitters (HHs) are large-volume flows that consume considerably more network resources than other flows combined. In SDN-based DCNs (SDDCNs), HHs cause non-trivial delays for small-volume flows known as non-HHs that are delay-sensitive. Uncontrolled forwarding of HHs leads to network congestion and overall network performance degradation. A pivotal task for controlling HHs is their identification. The existing methods to identify HHs are threshold-based. However, such methods lack a smart system that efficiently identifies HH according to the network behaviour. In this paper, we introduce a novel approach to overcome this lack and investigate the feasibility of using Knowledge-Defined Networking (KDN) in HH identification. KDN by using Machine Learning (ML), allows integrating behavioural models to detect patterns, like HHs, in SDN traffic. Our KDN-based approach includes mainly three modules: HH Data Acquisition Module (HH-DAM), Data ANalyser Module (HH-DANM), and APplication Module (HH-APM). In HH-DAM, we present the flowRecorder tool for organizing packets into flows records. In HH-DANM, we perform a cluster-based analysis to determine an optimal threshold for separating HHs and non-HHs. Finally, in HH-APM, we propose the use of MiceDCER for routing non-HHs efficiently. The per-module evaluation results corroborate the usefulness and feasibility of our approach for identifying HHs.
A heavy-hitter (HH) network traffic flow consumes considerably more network resources than other flows combined. The classification of HHs is critical to provide, among others, the required level of Quality of Service and reliability in both conventional and data center networks. HH classification is typically threshold-based. However, there is no consistent and accepted threshold or set of thresholds that would reliably classify flows. Furthermore, existing threshold-driven approaches use counters (e.g., duration, packets, and bytes); thus, their accuracy depends on how complete the flow information is. This paper paves the way to threshold-agnostic HH identification by proposing an approach that performs HH classification based on per-flow packet size distribution (PSD) and template matching (TM). PSD allows capturing the behavior and dynamism of network traffic flows (even from their first few packets). TM enables to classify HHs by measuring the similarity between the PSD of observed flows and a set of master templates representing the flow size behavior of HH classes. We evaluated the PSD-and TM-based approach using flows extracted from real traffic traces. Results show that our approach classifies HHs accurately and timely, corroborating that the threshold-less perspective is feasible for HH identification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.