2015 IEEE 22nd International Conference on High Performance Computing (HiPC) 2015
DOI: 10.1109/hipc.2015.9
|View full text |Cite
|
Sign up to set email alerts
|

On the Resilience of Parallel Sparse Hybrid Solvers

Abstract: As the computational power of high performance computing (HPC) systems continues to increase by using a huge number of CPU cores or specialized processing units, extreme-scale applications are increasingly prone to faults. Consequently, the HPC community has proposed many contributions to design resilient HPC applications. These contributions may be system-oriented, theoretical or numerical. In this study we consider an actual fully-featured parallel sparse hybrid (direct/iterative) linear solver, MAPHYS, and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 19 publications
0
9
0
Order By: Relevance
“…where Ω is a (nice) domain in R We consider Schwarz iterative methods (also called subspace correction methods) for solving (1). The underlying space splitting is given by a collection {V i } i=0,1,...,n of n + 1 separable real Hilbert spaces, each equipped with a spectrally equivalent scalar product a i (·, ·) and norm a i (v i , v i ) 1/2 , and bounded linear operators R i :…”
Section: Theoretical Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…where Ω is a (nice) domain in R We consider Schwarz iterative methods (also called subspace correction methods) for solving (1). The underlying space splitting is given by a collection {V i } i=0,1,...,n of n + 1 separable real Hilbert spaces, each equipped with a spectrally equivalent scalar product a i (·, ·) and norm a i (v i , v i ) 1/2 , and bounded linear operators R i :…”
Section: Theoretical Resultsmentioning
confidence: 99%
“…In the m-th step of a Schwarz iterative method for solving (1), a certain finite set I m ⊂ {0, 1, . .…”
Section: Theoretical Resultsmentioning
confidence: 99%
“…Interpolation-Restart (IR) techniques are designed to cope with node crashes (hard faults) in a parallel distributed environment (Agullo et al, 2015(Agullo et al, , 2017(Agullo et al, , 2016a(Agullo et al, , 2016b. The methods can be designed at the algebraic level for the solution both of linear systems and of eigenvalue problems.…”
Section: Interpolation-restartmentioning
confidence: 99%
“…In this paper, we extend the interpolation-restart (IR) strategies introduced for the solution of linear systems [1,21,3] to a few state-of-the-art eigensolvers. More precisely, the Arnoldi [6], Implicitly restarted Arnoldi [22], subspace iteration [30], and Jacobi-Davidson [35] algorithms are being revisited to make them resilient in the presence of faults.…”
Section: Interpolation-restart Strategies For Resilient Eigensolversmentioning
confidence: 99%