2011
DOI: 10.1007/978-3-642-24669-2_11
|View full text |Cite
|
Sign up to set email alerts
|

SpotMPI: A Framework for Auction-Based HPC Computing Using Amazon Spot Instances

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0
1

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
4
2

Relationship

2
8

Authors

Journals

citations
Cited by 29 publications
(18 citation statements)
references
References 16 publications
0
15
0
1
Order By: Relevance
“…Their later research employs fault tolerance such as checkpointing, task duplication, and migration [25]. SpotMPI [26] facilitates execution of MPI applications on auction-based Cloud platforms based on adjusted optimal checkpoint-restart (CPR) intervals. Their tookit can automate checkpointing at bidding price and restart application after interruption.…”
Section: Related Workmentioning
confidence: 99%
“…Their later research employs fault tolerance such as checkpointing, task duplication, and migration [25]. SpotMPI [26] facilitates execution of MPI applications on auction-based Cloud platforms based on adjusted optimal checkpoint-restart (CPR) intervals. Their tookit can automate checkpointing at bidding price and restart application after interruption.…”
Section: Related Workmentioning
confidence: 99%
“…While a set of published projects are setting the tone for future developments in terms of resource planning and management [37], since the problem space is so large, we still expect many new approaches to meet the HPC sustainability challenges: performance, reliability, scalability, intercloud operability, and usability of virtual clusters all at the same time.…”
Section: Challenges Of Virtual Cluster Managementmentioning
confidence: 99%
“…While these research efforts have been successful at helping clarify many points about spot instances, only a few, such as [22], have touched on HPC applications and how the difference between high performance and high throughput applications impacts the fault tolerance strategies used to mitigate the interruptions due to market fluctuations. While describing strategies for single applications is crucial to the understanding of spot resources and auction-based computing in general, this knowledge is not fully transferable to HPC computing and there are many differences when applications are meant to scale to higher orders of magnitudes.…”
Section: Previous Work On Spot Instancesmentioning
confidence: 99%