2014
DOI: 10.1016/j.compfluid.2013.10.035
|View full text |Cite
|
Sign up to set email alerts
|

MPI-CUDA sparse matrix–vector multiplication for the conjugate gradient method with an approximate inverse preconditioner

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0
2

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 36 publications
(23 citation statements)
references
References 18 publications
0
21
0
2
Order By: Relevance
“…The application of the technology and its component approaches to simulating turbulent flows is exemplified in [24], where compressible flows, a combined finite volume/finite difference method, unstructured grids, and a vertex centered scheme were consid ered; in [10,6], where compressible flows, the finite volume method, unstructured hybrid grids, polyno mial reconstruction, and a cell centered scheme were used; in [14], where incompressible flows, the finite volume method, and structured staggered grids were employed; and in [25], where incompressible flows, the finite volume method, and unstructured grids were applied. These algorithms are intended for large scale computations of turbulent flows employing tens of thousands of processor cores and multiple massively parallel accelerators.…”
Section: Resultsmentioning
confidence: 99%
“…The application of the technology and its component approaches to simulating turbulent flows is exemplified in [24], where compressible flows, a combined finite volume/finite difference method, unstructured grids, and a vertex centered scheme were consid ered; in [10,6], where compressible flows, the finite volume method, unstructured hybrid grids, polyno mial reconstruction, and a cell centered scheme were used; in [14], where incompressible flows, the finite volume method, and structured staggered grids were employed; and in [25], where incompressible flows, the finite volume method, and unstructured grids were applied. These algorithms are intended for large scale computations of turbulent flows employing tens of thousands of processor cores and multiple massively parallel accelerators.…”
Section: Resultsmentioning
confidence: 99%
“…Both preconditioners can be represented as a SpMV at the solution stage . Further details can be found in [22]. In conclusion, we express the full time integration step in terms of three algebraic kernels: the SpMV, the linear combination of two vectors (AXPY) and the dot product.…”
Section: Governing Equations and Numerical Methodsmentioning
confidence: 99%
“…Several works [11]- [13] have presented approaches for optimising matrix multiplications through the usage of GPUs. Both [11], [12] focused on the overlapping between computation and data communication in order to minimise the communication and transfer times. Dang et.…”
Section: Related Workmentioning
confidence: 99%
“…The matrix multiplication implementation considered a bi-directional data transfer. Oyarzun et al [12] proposed an approach for matrix multiplication in the context of the conjugate gradient method in multi-GPU environments. In particular, the proposed solver was tested for the Poisson equation and reported improvements of up to a 200% regarding the CPU-only solver.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation