2009
DOI: 10.1007/978-3-642-03869-3_42
|View full text |Cite
|
Sign up to set email alerts
|

The Architecture of the XtreemOS Grid Checkpointing Service

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 15 publications
0
9
0
Order By: Relevance
“…Transparent integration of out-of-cluster compute resources into SSI clusters is not supported. However, the implementation of checkpointing and process group migration features allow jobs to be migrated from one cluster to another [109].…”
Section: Grid Integrationmentioning
confidence: 99%
“…Transparent integration of out-of-cluster compute resources into SSI clusters is not supported. However, the implementation of checkpointing and process group migration features allow jobs to be migrated from one cluster to another [109].…”
Section: Grid Integrationmentioning
confidence: 99%
“…1. XtreemOS Grid-Checkpointing Architecture [9] At the grid level, a job checkpointer service is in charge of managing the checkpoint/restart of a job, possibly spanning multiple grid nodes. It is located on the same node as the job manager and uses virtual nodes [11] to achieve service replication and thus high-availability.…”
Section: B Architecturementioning
confidence: 99%
“…Therefore, in order to achieve fault tolerance for distributed applications in grids, XtreemOS [8] deploys a heterogeneous grid checkpointing service called XtreemGCP [9]. This service is designed to support different checkpointing protocols and address the underlying grid-node checkpointers in a transparent manner through a uniform interface.…”
Section: Introductionmentioning
confidence: 99%
“…Sockets must also be closed and recreated on nodes whose checkpointers are incapable of handling open socket descriptors. Both tasks can be handled using the callback mechanism provided by XtreemGCP [13]. The latter explicitly executes registered callbacks before and after a checkpoint or after a restart.…”
Section: Gcc Architecturementioning
confidence: 99%
“…All these implementations come with different capabilities and there is no ultimate best checkpointer. Therefore, we have designed XtreemGCP -a grid checkpointing service capable of checkpointing and restarting a grid job running on nodes equipped with different checkpointing packages [13]. Each grid job consists of one or multiple job units.…”
Section: Introductionmentioning
confidence: 99%