2013 IEEE International Conference on Cluster Computing (CLUSTER) 2013
DOI: 10.1109/cluster.2013.6702626
|View full text |Cite
|
Sign up to set email alerts
|

Checkpoint-restart for a network of virtual machines

Abstract: Abstract-The ability to easily deploy parallel computations on the Cloud is becoming ever more important. The first uniform mechanism for checkpointing a network of virtual machines is described. This is important for the parallel versions of common productivity software. Potential examples of parallelism include Simulink for MATLAB, parallel R for the R statistical modelling language, parallel blast.py for the BLAST bioinformatics software, IPython.parallel for Python, and GNU parallel for parallel shells. Th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 14 publications
(15 reference statements)
0
10
0
Order By: Relevance
“…(See Section 8.1 for a discussion of MPI checkpoint-restart services.) Kangarlou et al [KEX12] and Garg et al [GSJC13] each show how to take a distributed snapshot of VMs. Kangarlou et al base this on a modification of Xen's live migration, while Garg et al employ DMTCP to take a distributed snapshot of KVM/QEMU VMs.…”
Section: Mechanisms Based On Vm Snapshotsmentioning
confidence: 99%
“…(See Section 8.1 for a discussion of MPI checkpoint-restart services.) Kangarlou et al [KEX12] and Garg et al [GSJC13] each show how to take a distributed snapshot of VMs. Kangarlou et al base this on a modification of Xen's live migration, while Garg et al employ DMTCP to take a distributed snapshot of KVM/QEMU VMs.…”
Section: Mechanisms Based On Vm Snapshotsmentioning
confidence: 99%
“…The process migration indeed is very much used in bigdata business process. The customers in today's world relay in the digital content and the application process that supports and manages these need to be managed with better efficient and in convenient ways [3,4]. In [5,6], the process of checkpoint and restarting the process with the available resource to provide an uninterrupted service the author suggest the method of check point and restart.…”
Section: Introductionmentioning
confidence: 99%
“…(See Section 2.4 for a discussion of MPI checkpoint-restart services.) Kangarlou et al [51] and Garg et al [39] each show how to take a distributed snapshot of VMs. Kangarlou et al base this on a modification of Xen's live migration, while Garg et al employ DMTCP to take a distributed snapshot of KVM/QEMU VMs.…”
Section: Fault Tolerance and Efficiency In The Cloudmentioning
confidence: 99%
“…Kangarlou et al base this on a modification of Xen's live migration, while Garg et al employ DMTCP to take a distributed snapshot of KVM/QEMU VMs. The last three investigations ( [61], [51], [39]) contrasts with the cloud-agnostic (and applicationagnostic) approach employed here by directly checkpointing the processes along with their network connections.…”
Section: Fault Tolerance and Efficiency In The Cloudmentioning
confidence: 99%
See 1 more Smart Citation