1997
DOI: 10.1137/1.9780898719642
|View full text |Cite
|
Sign up to set email alerts
|

ScaLAPACK Users' Guide

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
980
0
18

Year Published

1998
1998
2019
2019

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 1,138 publications
(1,012 citation statements)
references
References 0 publications
1
980
0
18
Order By: Relevance
“…The BLAS, LAPACK, and ScaLAPACK [5] projects place backward compatibility as a high priority, which hinders progress towards adopting modern software engineering principles such as object abstraction. libflame is built around opaque structures that hide implementation details of matrices, such as data layout, and exports object-based programming interfaces to operate upon these structures [4].…”
Section: What Is Differentmentioning
confidence: 99%
“…The BLAS, LAPACK, and ScaLAPACK [5] projects place backward compatibility as a high priority, which hinders progress towards adopting modern software engineering principles such as object abstraction. libflame is built around opaque structures that hide implementation details of matrices, such as data layout, and exports object-based programming interfaces to operate upon these structures [4].…”
Section: What Is Differentmentioning
confidence: 99%
“…A blockcyclic distribution generalizes blocked and cyclic distributions, by distributing contiguous blocks of any size b cyclically among processors. Cyclic decompositions are commonly used in parallel numerical linear algebra algorithms and frameworks such as ScaLAPACK (block-cyclic) [7] and Elemental (cyclic) [40]. Our method extends this decomposition to tensors.…”
Section: Cyclic Tensor Blockingmentioning
confidence: 99%
“…There are four basic steps involved in calling a ScaLAPACK routine. The reader is directed to the ScaLAPACK users' guide [15] for more details.…”
Section: Inputsmentioning
confidence: 99%
“…It is observed that for a process arrangement, execution times are the same no matter what algorithmic blocking factor is used. However to ensure efficient data distribution, ScaLAPACK [15] recommends that any blocking factor between 32 to 64 be used to distribute the rows and the columns of the matrices involved in the computation of the linear algebra kernel. The tool uses a value of 64; 3.…”
Section: Porting Issuesmentioning
confidence: 99%