2003
DOI: 10.1155/2003/278167
|View full text |Cite
|
Sign up to set email alerts
|

OpenMP Issues Arising in the Development of Parallel BLAS and LAPACK Libraries

Abstract: Abstract. Dense linear algebra libraries need to cope efficiently with a range of input problem sizes and shapes. Inherently this means that parallel implementations have to exploit parallelism wherever it is present. While OpenMP allows relatively fine grain parallelism to be exploited in a shared memory environment it currently lacks features to make it easy to partition computation over multiple array indices or to overlap sequential and parallel computations. The inherent flexible nature of shared memory p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2006
2006
2011
2011

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(19 citation statements)
references
References 11 publications
0
19
0
Order By: Relevance
“…• The pipelined algorithm from [1] does not perform nearly as well as the data-flow algorithms proposed in this paper.…”
Section: Resultsmentioning
confidence: 78%
See 2 more Smart Citations
“…• The pipelined algorithm from [1] does not perform nearly as well as the data-flow algorithms proposed in this paper.…”
Section: Resultsmentioning
confidence: 78%
“…The problem with this idea is that it greatly complicates the code that implements the algorithm [1,20,24]. While doable for a single, relatively simple algorithm like the Cholesky factorization, reimplementation of an entirely library like LAPACK becomes a daunting task when these techniques are employed.…”
Section: Multithreaded Blasmentioning
confidence: 99%
See 1 more Smart Citation
“…The problem with this idea is that it greatly complicates the code that implements the algorithm if coded in a traditional style [Addison et al 2003;Kurzak and Dongarra 2006;Strazdins 2001]. While feasible for a single, relatively simple algorithm like the LU factorization without pivoting or the Cholesky factorization, re-implementing a linear algebra library like LAPACK would become a daunting task if this strategy were employed.…”
Section: The Trouble With Evolving Legacy Code To Multithreaded Archimentioning
confidence: 99%
“…Jay Hoeflinger, Prasad Allavilli, Thomas Jackson and Bob Kuhn have studied scalability issues using OpenMP for CFD applications [9]. OpenMP issues in the development of parallel BLAS and LAPACK libraries have also been studied [2]. However, the issues, challenges related with programming and effective exploitation of shared memory multi-core systems with respect to cache parameters have not been considered.…”
Section: Related Workmentioning
confidence: 99%