2018
DOI: 10.1002/cpe.4412
|View full text |Cite
|
Sign up to set email alerts
|

Prospects and challenges of virtual machine migration in HPC

Abstract: Summary The continuous growth of supercomputers is accompanied by increased complexity of the intra‐node level and the interconnection topology. Consequently, the whole software stack ranging from the system software to the applications has to evolve, eg, by means of fault tolerance and support for the rising intra‐node parallelism. Migration techniques are one means to address these challenges. On the one hand, they facilitate the maintenance process by enabling the evacuation of individual nodes during runti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 47 publications
0
2
0
Order By: Relevance
“…The idea is to zero memory regions only when a migration or a checkpoint is imminent and thus to avoid unnecessary zero writes. The glibc library needs to be modified to react upon migration / checkpoint requests (similar to the approach for the migration of MPI applications in [32]). In doing so, it would place zeros into freed memory regions only directly before a migration starts.…”
Section: Generalization Of the Approachmentioning
confidence: 99%
“…The idea is to zero memory regions only when a migration or a checkpoint is imminent and thus to avoid unnecessary zero writes. The glibc library needs to be modified to react upon migration / checkpoint requests (similar to the approach for the migration of MPI applications in [32]). In doing so, it would place zeros into freed memory regions only directly before a migration starts.…”
Section: Generalization Of the Approachmentioning
confidence: 99%
“…Pickartz et al dealt with fault tolerance on a supercomputer on which one or several nodes may be turned off during a computation for maintenance reasons. The authors provided solutions to be able to efficiently and transparently stop some nodes while the HPC application is running.…”
Section: Hpc and Co: This Special Issuementioning
confidence: 99%