Cloud computing based systems, that span data centers, are commonly deployed to offer high performance for user service requests. As data centers continue to expand, computer architects and system designers are facing many challenges on how to balance resource utilization efficiency, server and network performance, energy consumption and quality-of-service (QoS) demands from the users.To develop effective data center management policies, it becomes essential to have an in-depth understanding and synergistic control of the various sub-components inside large scale computing systems, that include both computation and communication resources. Prior studies on performance and energy issues in data centers largely focus on either servers or the network and completely ignore issues relating to the other components, or consider only high level analytical models without sufficient detail, which can lead to non-optimal solutions. Unfortunately, it is prohibitively expensive or in some cases even impossible to have complete access to an operational large-scale computing system (e.g., production server farms). Therefore, a comprehensive simulation infrastructure that models all major hardware and system components, and offers interfaces to manage the interplay between both computation and communication resources are critical in advancing future research for more effective performance and energy optimization in data centers.In this paper, we propose HolDCSim, a light-weight, holistic, extensible, event-driven data center simulation platform that effectively models both server and network architectures. HolD-CSim can be used in a variety of data center system studies including job/task scheduling, resource provisioning, global and local server farm power management, and network and server performance analysis. We demonstrate the design of our simulation infrastructure, and illustrate the usefulness of our framework with several case studies that analyze server/network performance and energy efficiency. We also perform validation on real machines to verify our simulator.HolDCSim models a complete data center infrastructure by modeling network devices (e.g., switches) and interconnection among various nodes in the system (i.e., topologies). The network module creates a complete topology by connecting the switches and servers with network links. Network communication is modeled at two levels of granularity: packetbased communication and flow-based communication. Each
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.