2020 IEEE International Symposium on Workload Characterization (IISWC) 2020
DOI: 10.1109/iiswc50251.2020.00015
|View full text |Cite
|
Sign up to set email alerts
|

MATCH: An MPI Fault Tolerance Benchmark Suite

Abstract: MPI has been ubiquitously deployed in flagship HPC systems aiming to accelerate distributed scientific applications running on tens of hundreds of processes and compute nodes. Maintaining the correctness and integrity of MPI application execution is critical, especially for safety-critical scientific applications. Therefore, a collection of effective MPI fault tolerance techniques have been proposed to enable MPI application execution to efficiently resume from system failures. However, there is no structured … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 40 publications
0
0
0
Order By: Relevance