An Accelerated Recursive Doubling Algorithm for Block Tridiagonal Systems

Seal, Sudip K.

doi:10.1109/ipdps.2014.107

Cited by 3 publications

(9 citation statements)

References 14 publications

(26 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The original formulation of E i as described in [2] given in the theorem below, which also states the equivalence of the formulations described in this work and [2].…”

Section: VImentioning

confidence: 93%

“…The Accelerated Recursive Doubling Algorithm presented in [2] accelerates the computation of solving for x for multiple right-hand sides b by separating the computation dependent on b (right-hand side dependent) and the computation independent of the right-hand side. In order to do this, B i is separated into two matrices, one right-hand side independent (C i ), and one right-hand side dependent (F i ).…”

Section: I and F I Matricesmentioning

confidence: 99%

“…C i and F i matrices have the following properties that help simplify expressions involving their products, as presented in [2].…”

Section: I and F I Matricesmentioning

confidence: 99%

“…Using this formulation, we can start from E 1 and determine all subsequent E i using the recursive formulation. This formulation of E i is simpler but equivalent to the original formulation in [2].…”

Section: E I Matricesmentioning

confidence: 99%

“…Note: A method suggested in [2] to make ARDA numerically stable for the same classes of matrices for which Cyclic Reduction is stable is to take the LU-decomposition of A, and then solve Ax = LU x = b in two steps: First solving Ly = b, and then solve U x = y.…”

Section: VImentioning

confidence: 99%

See 4 more Smart Citations

Optimizing the Accelerated Recursive Doubling Algorithm for Block Tridiagonal Systems of Equations

Joshipura

Seal

2020

View full text Add to dashboard Cite

The need to solve block tridiagonal systems with hundreds or thousands of right-hand sides for the same block tridiagonal matrix is common in a variety of disciplines. To meet this need, the Accelerated Recursive Doubling Algorithm was developed. After a right-hand side independent phase, the algorithm allows for the quick, online calculation of solutions for different right-hand sides. In this work, we present methods to optimize the Accelerated Recursive Doubling Algorithm in memory usage and computation time in a hybrid parallelization model. The right-hand side independent phase of the naïve implementation takes ≥ 11 3 the amount of memory required to store the tridiagonal matrix, while our implementation reduces the fraction to ≈ 5 3. The right-hand side dependent phase of the naïve implementation takes ≥ 6 times the amount of memory required to store the right-hand side, while our implementation reduces the fraction to ≈ 3. The computation time for the independent phase is reduced to ≈ 2 3 times that of the naïve implementation, while the computation time for the dependent phase is reduced to ≈ 5 9. With increasing numbers of shared-memory threads q on every distributed processing element, we have O(q) theoretical speedup.

show abstract

“…The original formulation of E i as described in [2] given in the theorem below, which also states the equivalence of the formulations described in this work and [2].…”

Section: VImentioning

confidence: 93%

Section: I and F I Matricesmentioning

confidence: 99%

“…C i and F i matrices have the following properties that help simplify expressions involving their products, as presented in [2].…”

Section: I and F I Matricesmentioning

confidence: 99%

Section: E I Matricesmentioning

confidence: 99%

Section: VImentioning

confidence: 99%

See 3 more Smart Citations