2018 IEEE 25th International Conference on High Performance Computing (HiPC) 2018
DOI: 10.1109/hipc.2018.00031
|View full text |Cite
|
Sign up to set email alerts
|

Characterization of the Impact of Soft Errors on Iterative Methods

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 41 publications
0
7
0
Order By: Relevance
“…These ephemeral data structures are usually well-tuned for DRAM-based high-performance computing (HPC) systems [4,19,45,46]. Large-scale HPC systems are likely to be interrupted by failures because its components are not reliable [21,35,50]. Consequently, checkpointing is required for long-running HPC applications designed with the ephemeral data structures to provide failure recovery.…”
Section: Introductionmentioning
confidence: 99%
“…These ephemeral data structures are usually well-tuned for DRAM-based high-performance computing (HPC) systems [4,19,45,46]. Large-scale HPC systems are likely to be interrupted by failures because its components are not reliable [21,35,50]. Consequently, checkpointing is required for long-running HPC applications designed with the ephemeral data structures to provide failure recovery.…”
Section: Introductionmentioning
confidence: 99%
“…As discussed in the previous section, the data can be leveraged for further anaylsis and can facilitate future studies. Therefore, at the end of this study, we created a publicly available database called Iterative Method Injection Collection at https://github.com/pnnl/IMIC [58]. This database has all 1.75 million injection results with traces collected during this study.…”
Section: Imic Databasementioning
confidence: 99%
“…That is, for the n th iteration, we compare the injected vector value with the vectors at the {n-20 .. n+20} iterations. We compare the injected vector with the corresponding healthy vector's values over the course of iteration window, as any change introduced by the error can hinder the convergence, but also, by chance, it can help moving the execution to the right direction [58]. We calculated ℓ1 norms between each vector and the correct execution's corresponding vector range.…”
Section: • Iteration Percentage Calculated Asmentioning
confidence: 99%
See 2 more Smart Citations