2016
DOI: 10.1007/s10766-016-0434-5
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Loop Self-Scheduling Schemes for Large-Scale Clusters and Cloud Systems

Abstract: Cloud systems have demonstrated the powerful computation and storage capability in many scientific applications. In this paper, we propose a class of scalable distributed loop self-scheduling schemes to achieve good load balancing and scalability. We implemented these schemes on a large-scale cluster and on a heterogeneous cloud system. The schemes consider the distribution of the output data, which can help reduce communication overhead and improve scalability. We evaluated the schemes using four scientific c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…So far, several loop scheduling strategies have been introduced to address Loop Scheduling Problem (LSP), [3][4][5][6][7][8][9] and they mainly rely on two techniques. In the first one, called on-demand scheduling, iterations are assigned on-the-fly at runtime, so that load imbalance and runtime variations may be dynamically handled.…”
Section: Introductionmentioning
confidence: 99%
“…So far, several loop scheduling strategies have been introduced to address Loop Scheduling Problem (LSP), [3][4][5][6][7][8][9] and they mainly rely on two techniques. In the first one, called on-demand scheduling, iterations are assigned on-the-fly at runtime, so that load imbalance and runtime variations may be dynamically handled.…”
Section: Introductionmentioning
confidence: 99%
“…Load balancing by considering the current status of all the available resources will solve the problem of inefficient utilization of resources. A scalable distributed loop self‐scheduling scheme is a load balancing method with reduced communication overhead. Although the system is scalable, it is only for the homogeneous clusters.…”
Section: Related Workmentioning
confidence: 99%