2016
DOI: 10.1016/j.procs.2016.05.294
|View full text |Cite
|
Sign up to set email alerts
|

Portable Application-level Checkpointing for Hybrid MPI-OpenMP Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
19
0
2

Year Published

2018
2018
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(21 citation statements)
references
References 5 publications
0
19
0
2
Order By: Relevance
“…As will be discussed in the present section, the application-level checkpoint and restart (ALCR) mechanism is the most effective mechanism for building software applications that are fault tolerant from the beginning [37][38][39]. However, since it is based on the deliberate insertion of checkpoints into the source code, it requires significant expertise and development effort.…”
Section: Optimum Checkpoint Recommendationmentioning
confidence: 99%
See 4 more Smart Citations
“…As will be discussed in the present section, the application-level checkpoint and restart (ALCR) mechanism is the most effective mechanism for building software applications that are fault tolerant from the beginning [37][38][39]. However, since it is based on the deliberate insertion of checkpoints into the source code, it requires significant expertise and development effort.…”
Section: Optimum Checkpoint Recommendationmentioning
confidence: 99%
“…Checkpoint and rollback/recovery is one of the most widely-used mechanisms for adding fault tolerance to software applications [37][38][39]. It was originally developed for enhancing the reliability of transaction-oriented computer systems (e.g.…”
Section: Transaction-oriented Systems and Optimum Checkpoint Intervalmentioning
confidence: 99%
See 3 more Smart Citations