2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (Ccgrid 2012) 2012
DOI: 10.1109/ccgrid.2012.51
|View full text |Cite
|
Sign up to set email alerts
|

Productive Parallel Linear Algebra Programming with Unstructured Topology Adaption

Abstract: Abstract-Sparse linear algebra is a key component of many scientific computations such as computational fluid dynamics, mechanical engineering or the design of new materials to mention only a few. The discretization of complex geometries in unstructured meshes leads to sparse matrices with irregular patterns. Their distribution in turn results in irregular communication patterns within parallel operations.In this paper, we show how sparse linear algebra can be implemented effortless on distributed memory archi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…Finally, each destination GPU shares its received border with the other side of its GPU-pair via NVLink (ln. [14][15][16]. Once the inter-GPU communication completes, each GPU processes its remaining part of the sub-image (i.e., sync zone) (ln.…”
Section: Summary Of the Designmentioning
confidence: 99%
“…Finally, each destination GPU shares its received border with the other side of its GPU-pair via NVLink (ln. [14][15][16]. Once the inter-GPU communication completes, each GPU processes its remaining part of the sub-image (i.e., sync zone) (ln.…”
Section: Summary Of the Designmentioning
confidence: 99%
“…The state type is represented by thrust :: device vector<double>: 1 typedef thrust::device vector<double> state type; The X, Y , and Z components of the state are held in the continuous partitions of the vector. operator() uses the standard technique of packing the state components into a zip iterator and passes the composite sequence to the thrust :: for each algorithm together with the provided device function object: 12 struct lorenz functor ; dxdt.begin(), dxdt.begin() + N, dxdt.begin() + 2 * N ) ), 21 thrust :: make zip iterator( thrust :: make tuple( R.end(), 23 x.begin() + N, x.begin() + 2 * N, x.end(), 24 dxdt.begin() + N, dxdt.begin() + 2 * N, dxdt.end() ) ), 25 lorenz functor () ); 26…”
Section: };mentioning
confidence: 99%
“…The only difference here is that values of neighboring vector elements are needed. In order to access these values, we use Thrust's permutation iterator, so that operator() of the system function object becomes thrust :: make permutation iterator( x.begin(), prev.begin() ), 6 thrust :: make permutation iterator( x.begin(), next.begin() ), 7 omega.begin() , dxdt.begin() ) ), thrust :: make permutation iterator( x.begin(), prev.end() ), 12 thrust :: make permutation iterator( x.begin(), next.end() ), 13 omega.end(), dxdt.end() ) ), 14 phase oscillators functor () 15 );…”
mentioning
confidence: 99%
“…Algebraic MultiGrid (AMG) [35,37] is a robust preconditioner for elliptic problems. It is appreciated for its extensibility qualities with M-matrix systems: the number of iterations required to converge only depends minimally on the problem size and can be entirely sizeindependent.…”
Section: Algebraic Multigridmentioning
confidence: 99%
“…The Parallel Matrix Template Library v4 (PMTL4) [10,35] provides linear algebra operations on distributed data as a C + + template library. Available data types are distributed vector and sparse and dense matrix types as well as abstractions to conveniently handle distribution and migration.…”
Section: Pmtl4mentioning
confidence: 99%