2016
DOI: 10.1016/j.parco.2015.10.003
|View full text |Cite
|
Sign up to set email alerts
|

Recursion based parallelization of exact dense linear algebra routines for Gaussian elimination

Abstract: We present block algorithms and their implementation for the parallelization of sub-cubic Gaussian elimination on shared memory architectures. Contrarily to the classical cubic algorithms in parallel numerical linear algebra, we focus here on recursive algorithms and coarse grain parallelization. Indeed, sub-cubic matrix arithmetic can only be achieved through recursive algorithms making coarse grain block algorithms perform more efficiently than fine grain ones. This work is motivated by the design and implem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
10
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(11 citation statements)
references
References 26 publications
1
10
0
Order By: Relevance
“…The issues concerning parallel numerical algorithms and particularly Gaussian elimination on multicore architectures were presented, among others, by Dumas et al (2016) and Buttari et al (2009). Dumas et al (2016) present investigations into the parallelization of sub-cubic Gaussian elimination on shared memory multicore architectures. They focus on the parallelization of three sub-routines, namely, matrix multiplication, triangular system solving and the PLUQ factorization.…”
Section: Related Workmentioning
confidence: 99%
“…The issues concerning parallel numerical algorithms and particularly Gaussian elimination on multicore architectures were presented, among others, by Dumas et al (2016) and Buttari et al (2009). Dumas et al (2016) present investigations into the parallelization of sub-cubic Gaussian elimination on shared memory multicore architectures. They focus on the parallelization of three sub-routines, namely, matrix multiplication, triangular system solving and the PLUQ factorization.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, several groups have been working on the efficient parallel linear algebra libraries, particularly the Gaussian elimination. The Gaussian elimination on multicore and manycore architectures was studied, among others, in works [9], [2], [6] and [8]. In the work [9] the authors investigated the parallelization of sub-cubic Gaussian elimination.…”
Section: Factorisationmentioning
confidence: 99%
“…The Gaussian elimination on multicore and manycore architectures was studied, among others, in works [9], [2], [6] and [8]. In the work [9] the authors investigated the parallelization of sub-cubic Gaussian elimination. They focused on the parallelization of three subroutines, namely, the matrix multiplication, the triangular equation solver and the LU factorisation with pivoting.…”
Section: Factorisationmentioning
confidence: 99%
“…This reduces the data locality of the algorithm, and therefore penalizes the efficiency of implementations in practice. In parallel, this blocking also puts more constrains on the dependencies between tasks Dumas et al (2015a).…”
Section: Introductionmentioning
confidence: 99%