2018
DOI: 10.1109/tpds.2018.2842224
|View full text |Cite
|
Sign up to set email alerts
|

A Self-Adaptive Network for HPC Clouds: Architecture, Framework, and Implementation

Abstract: Clouds offer flexible and economically attractive compute and storage solutions for enterprises. However, the effectiveness of cloud computing for high-performance computing (HPC) systems still remains questionable. When clouds are deployed on lossless interconnection networks, like InfiniBand (IB), challenges related to load-balancing, low-overhead virtualization, and performance isolation hinder full potential utilization of the underlying interconnect. Moreover, cloud data centers incorporate a highly dynam… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 35 publications
0
8
0
Order By: Relevance
“…This subsection validates the effectiveness of our proposed algorithms. We generate five network configurations, i.e., Galaxyfly (3,5,4), Galaxyfly (3,5,8), Galaxyfly(4, 5, 5), Galaxyfly (4,7,4), and Galaxyfly (4,7,5). The packet size, denoted by PS, varies among 160B, 320B, 640B, and 1280B.…”
Section: Validationmentioning
confidence: 99%
See 1 more Smart Citation
“…This subsection validates the effectiveness of our proposed algorithms. We generate five network configurations, i.e., Galaxyfly (3,5,4), Galaxyfly (3,5,8), Galaxyfly(4, 5, 5), Galaxyfly (4,7,4), and Galaxyfly (4,7,5). The packet size, denoted by PS, varies among 160B, 320B, 640B, and 1280B.…”
Section: Validationmentioning
confidence: 99%
“…In the case of exascale systems, for instance, a power envelope of 20-30 MW must be strictly adhered to in order to ensure low power consumption [7]. Third, the excellent network design should endow HPC systems with high flexibility so that HPC systems possess the ability to vary on different scales [8].…”
Section: Introductionmentioning
confidence: 99%
“…Resource allocation problem is omnipresent in cloud computing and many methods have been proposed to enhance resource utilization with rational resource provisioning and effective cost control. For example, Zahid et al [10] proposed a ruled-based language for CSPs, in order to improve the QoS compliance of high-performance computing (HPC) clouds. Through using probabilistic thresholds, a system model was designed in [23] for accomplishing the switching between different operating levels of cloud services.…”
Section: Classic Approaches For Cloud Resource Allocationmentioning
confidence: 99%
“…Many classic solutions for cloud resource allocation are based on rules [10], heuristics [11], and control theory [12]. Although these solutions can solve the problem of cloud resource allocation to some extent, they commonly use the prior knowledge of cloud systems (e.g., state transitions, demand changes, and energy consumptions) to develop corresponding strategies of resource allocation.…”
Section: Introductionmentioning
confidence: 99%
“…Maurer et al [9] designed a resource-efficient decision-making method in response to workload fluctuations, where a rule-based knowledge management strategy was proposed to achieve the autonomous reconstruction of VMs. Zahid et al [10] designed a ruled-based language for CSPs with adaptation schemes, in order to enhance the QoS compliance in highperformance computing (HPC) clouds. However, these two works did not regard the minimization of resource costs as an optimization goal, which may lead to excessive resource costs.…”
Section: A Literature Reviewmentioning
confidence: 99%