2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems 2011
DOI: 10.1109/mascots.2011.27
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing and Improving MPI Communication Performance in Overcommitted Virtualized Systems

Abstract: Abstract-Nowadays, it is an important trend in the system domain to use the software-based virtualization technology to build the execution environments (e.g., Clouds) and serve high performance computing (HPC) applications. However, with the extra virtualization layer, the application performance may be negatively affected. Studies revealed that the communication performance of the MPI library, which is widely used by the HPC applications, would suffer a high penalty when a physical host machine becomes overc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…We choose three benchmark programs (called is, ep, and lu) from NPB suite of version 2.4 [15], as they exhibit three typical types of parallel executions: communication intensive application with little computation (is); CPU intensive application with little communication (ep); and the one that lies in between them (lu) [16]. For each VM, the number of packets and that of spinlocks are recorded every 120 milliseconds (multiplying the 30ms of Xen Credit scheduler by the number of VMs in this experiment) over 60 seconds.…”
Section: Basic Ideamentioning
confidence: 99%
See 1 more Smart Citation
“…We choose three benchmark programs (called is, ep, and lu) from NPB suite of version 2.4 [15], as they exhibit three typical types of parallel executions: communication intensive application with little computation (is); CPU intensive application with little communication (ep); and the one that lies in between them (lu) [16]. For each VM, the number of packets and that of spinlocks are recorded every 120 milliseconds (multiplying the 30ms of Xen Credit scheduler by the number of VMs in this experiment) over 60 seconds.…”
Section: Basic Ideamentioning
confidence: 99%
“…Four functions are used to get a suitable VCPU when runnable VCPUs include parallel ones (lines [8][9][10][11][12][13][14][15][16][17][18][19]. First, it calls get_parallel_vcpus_with_maxPackets() to obtain VCPUs, which are put in vcpuSet1, from run queue runq if parallel VMs they belongs to receive maximum packets since their last de-scheduled (line 8).…”
Section: Svs Algorithmmentioning
confidence: 99%