2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing 2014
DOI: 10.1109/ccgrid.2014.30
|View full text |Cite
|
Sign up to set email alerts
|

Achieving Efficient Distributed Scheduling with Message Queues in the Cloud for Many-Task Computing and High-Performance Computing

Abstract: Abstract-Task scheduling and execution over large scale, distributed systems plays an important role on achieving good performance and high system utilization. Due to the explosion of parallelism found in today's hardware, applications need to perform over-decomposition to deliver good performance; this over-decomposition is driving job management systems' requirements to support applications with a growing number of tasks with finer granularity. Our goal in this work is to provide a compact, light-weight, sca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 21 publications
(18 citation statements)
references
References 29 publications
0
18
0
Order By: Relevance
“…This mechanism suffers longtail problem under heterogeneous workloads [34] due to early binding of tasks to worker resources. We have compared Sparrow and the basic MATRIX without data-aware scheduling technique using heterogeneous workloads in [39], and MATRIX outperforms Sparrow by 9X. Furthermore, there is an implementation barrier with Sparrow as it is developed in Java, which has little support in high-end computing systems.…”
Section: E Different Benchmark Dagsmentioning
confidence: 99%
See 1 more Smart Citation
“…This mechanism suffers longtail problem under heterogeneous workloads [34] due to early binding of tasks to worker resources. We have compared Sparrow and the basic MATRIX without data-aware scheduling technique using heterogeneous workloads in [39], and MATRIX outperforms Sparrow by 9X. Furthermore, there is an implementation barrier with Sparrow as it is developed in Java, which has little support in high-end computing systems.…”
Section: E Different Benchmark Dagsmentioning
confidence: 99%
“…CloudKon [39] has similar architecture as MATRIX, except that CloudKon focuses on the Cloud environment, and relies on the Cloud services, SQS [40] to do distributed load balancing, and DynamoDB [41] as the DKVS to keep task metadata. Relying on the Cloud services could facilitate the easier development, at the cost of potential loss of performance and control.…”
Section: E Different Benchmark Dagsmentioning
confidence: 99%
“…This is all done on atomic ba time a copy of message is locked in the fron the message is successfully delivered to the if router didn't get the message from the will get it from replica node. Compared t our system offers exactly one delivery exactly one delivery functionality in Ama DynamoDB as used in CloudKon, the per Amazon SQS decreases by 30% [8]. W service bus provides At Least Once Proce failure.…”
Section: Refinements 1)mentioning
confidence: 99%
“…A of repeated message we still will be hav messages. So if we want to stop these re from SQS, we can use DyanmoDB for han delivery of message but it will probab performance of the whole system by 30 CloudKon [8].…”
Section: Refinements 1)mentioning
confidence: 99%
“…In recent years, driven by the rapid development of computer hardware resources, the physical resources (including CPU, network, storage, I / O devices, VMware as the representative) virtualization technology has made considerable progress, and provides a solid foundation for the rapid development of cloud computing [4].…”
Section: Introductionmentioning
confidence: 99%