Cloud computing based systems, that span data centers, are commonly deployed to offer high performance for user service requests. As data centers continue to expand, computer architects and system designers are facing many challenges on how to balance resource utilization efficiency, server and network performance, energy consumption and quality-of-service (QoS) demands from the users.To develop effective data center management policies, it becomes essential to have an in-depth understanding and synergistic control of the various sub-components inside large scale computing systems, that include both computation and communication resources. Prior studies on performance and energy issues in data centers largely focus on either servers or the network and completely ignore issues relating to the other components, or consider only high level analytical models without sufficient detail, which can lead to non-optimal solutions. Unfortunately, it is prohibitively expensive or in some cases even impossible to have complete access to an operational large-scale computing system (e.g., production server farms). Therefore, a comprehensive simulation infrastructure that models all major hardware and system components, and offers interfaces to manage the interplay between both computation and communication resources are critical in advancing future research for more effective performance and energy optimization in data centers.In this paper, we propose HolDCSim, a light-weight, holistic, extensible, event-driven data center simulation platform that effectively models both server and network architectures. HolD-CSim can be used in a variety of data center system studies including job/task scheduling, resource provisioning, global and local server farm power management, and network and server performance analysis. We demonstrate the design of our simulation infrastructure, and illustrate the usefulness of our framework with several case studies that analyze server/network performance and energy efficiency. We also perform validation on real machines to verify our simulator.HolDCSim models a complete data center infrastructure by modeling network devices (e.g., switches) and interconnection among various nodes in the system (i.e., topologies). The network module creates a complete topology by connecting the switches and servers with network links. Network communication is modeled at two levels of granularity: packetbased communication and flow-based communication. Each