2013
DOI: 10.1016/j.jocs.2012.08.017
|View full text |Cite
|
Sign up to set email alerts
|

Efficient SIMD solution of multiple systems of stiff IVPs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 18 publications
0
7
0
Order By: Relevance
“…Such methods are generally not as stable as fully implicit RK methods; however, the reduced stability, which manifests itself in terms of smaller stable step sizes, is often more than made up for by the reduced computational expense per step [Hairer and Wanner 1996;Sandu et al 1997]. Furthermore, the implementation of linearly implicit methods is typically much easier, less subject to tuning of convergence parameters, etc., and in some situations, due to their predictable number of operations per iteration, may be more amenable to parallelization [Kroshko and Spiteri 2013]. An IMEX ARK method can be used as a linearly implicit RK method by using the Jacobian J f (t, y(t)) of f(t, y(t)) to split the RHS into linear and nonlinear parts [Cooper and Sayfy 1983] as f [1] (t, y) = J f (t, y(t)) · y,…”
Section: Methodsmentioning
confidence: 99%
“…Such methods are generally not as stable as fully implicit RK methods; however, the reduced stability, which manifests itself in terms of smaller stable step sizes, is often more than made up for by the reduced computational expense per step [Hairer and Wanner 1996;Sandu et al 1997]. Furthermore, the implementation of linearly implicit methods is typically much easier, less subject to tuning of convergence parameters, etc., and in some situations, due to their predictable number of operations per iteration, may be more amenable to parallelization [Kroshko and Spiteri 2013]. An IMEX ARK method can be used as a linearly implicit RK method by using the Jacobian J f (t, y(t)) of f(t, y(t)) to split the RHS into linear and nonlinear parts [Cooper and Sayfy 1983] as f [1] (t, y) = J f (t, y(t)) · y,…”
Section: Methodsmentioning
confidence: 99%
“…The CPU thread issues explicit SIMD instructions to enact operations across the parallel lanes of the vector units. Kroshko and Spiteri [33] demonstrated this approach in their SIMD implementation of a RODAS Rosen-brock solver. There, they reported a speed-up of 1.89 × (i.e., 94 % parallel efficiency) when solving multiple systems of stiff IVPs on a cell broadband engine.…”
Section: Parallel Integrator Implementationsmentioning
confidence: 99%
“…Thus, every thread in a warp may have similar collapse strength and similar amount of slow down during the collapse phase. Such a "clustering" technique is already suggested by [90]. The number of registers required to avoid spilling in the Keller-Miksis test case is 184.…”
Section: Test Case: Pressure Relief Valve (Impact Dynamics)mentioning
confidence: 99%