2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies 2008
DOI: 10.1109/pdcat.2008.14
|View full text |Cite
|
Sign up to set email alerts
|

Checkpointing Process Groups in a Grid Environment

Abstract: International audienceThe EU-funded XtreemOS project implements a grid operating system transparently exploiting resources of virtual organizations through the standard POSIX interface. Grid checkpointing and restart requires to save and restore jobs executing in a distributed heterogeneous grid environment. The latter may spawn millions of grid nodes ( PCs, clusters, and mobile devices ) using different system-specific checkpointers saving and restoring application and kernel data structures for processes exe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2009
2009
2012
2012

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Since no separate UNIX session or UNIX process group is initiated at job submission, BLCR cannot use these two process group semantics. More information can be found under [18].…”
Section: Process Groupsmentioning
confidence: 99%
See 1 more Smart Citation
“…Since no separate UNIX session or UNIX process group is initiated at job submission, BLCR cannot use these two process group semantics. More information can be found under [18].…”
Section: Process Groupsmentioning
confidence: 99%
“…The kernel is still able to distinguish equally named identifiers used across multiple applications, by isolating resource groups and mapping each identifier to a unique one at kernel level. Integration of state-of-the-art lightweight virtualization mechanisms provided by mainline Linux into XtreemOS is in progress, see also [18].…”
Section: Resource Conflictsmentioning
confidence: 99%
“…Thus it provides a more flexible solution with advantages of both reduced runtime overhead and localized recovery effect. Recent reports have shown feasibility as well as efficiency of applying group checkpoints to certain MPI programs [13] and in specific grid computing environments [14].…”
Section: Introductionmentioning
confidence: 99%