The Parallel Tiled WZ Factorization Algorithm for Multicore Architectures

Bylina, Beata; Bylina, Jarosław

doi:10.2478/amcs-2019-0030

Cited by 3 publications

(5 citation statements)

References 20 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…QIF is known for the adaptability of its direct method to solve systems of linear equations. Thus, the factorization gives rise to the use of Parallel implicit elimination (PIE) for the solution of linear system to simultaneously compute two matrix elements (two columns at a time) for parallel implementation, unlike Gaussian elimination (GE) which computes one column at a time [13]. The stability of QIF comes from the centro-nonsingular matrix (central submatrices are nonsingular) which is far reliable than any other type of factorization [8].…”

Section: Quadrant Interlocking Factorizationmentioning

confidence: 99%

“…While LU factorization performs elimination in serial with n − 1 steps, W Z factorization executes components in parallel with n 2 steps if n is even or n−1 2 steps if n is odd. W Z factorization simultaneously computes two matrix elements (two columns at a time), unlike LU factorization which computes one column at a time [13,12,36]. Unlike W Z factorization, LU factorization is not unique but block LU factorization with higher diagonal blocks gives similar analytic result as W Z factorization [37].…”

Section: W Z Factorization Versus Lu Factorizationmentioning

confidence: 99%

See 1 more Smart Citation

A Review on Quadrant Interlocking Factorization: WZ and WH Factorization

Bashir

Kamarulhaili

Babarinsa

2023

J. Nig. Soc. Phys. Sci.

View full text Add to dashboard Cite

Quadrant Interlocking Factorization (QIF), an alternative to LU factorization, is suitable for factorizing invertible matrix A such that det(A) , 0. The QIF, with its application in parallel computing, is the most efficient matrix factorization technique yet underutilized. The two forms of QIF among others, which are not only similar in algorithm but also in computation, are WZ factorization and WH factorization yet differs in applications and properties. This review discusses both the old form of QIF, called WZ factorization, and the latest form of QIF, called WH factorization, with an example and open questions to further the studies between the two factorization techniques.

show abstract

Section: Quadrant Interlocking Factorizationmentioning

confidence: 99%

Section: W Z Factorization Versus Lu Factorizationmentioning

confidence: 99%

A Review on Quadrant Interlocking Factorization: WZ and WH Factorization

Bashir

Kamarulhaili

Babarinsa

2023

J. Nig. Soc. Phys. Sci.

View full text Add to dashboard Cite

show abstract

“…erefore, the methods described in these references lack generality [31,[38][39][40][41]. As reported in [32,[42][43][44] the relevant features, such as the number of loop iterations of multilevel nested loops and the number of nested layers, were extracted from the intermediate representation of the compiler to construct the loop selection assessment model of multilevel nested loops. Compared with the models based on the thread-level speculation technique, these models improve the parallelization effect of multilevel nested loops to some extent.…”

Section: Related Workmentioning

confidence: 99%

“…Compared with the models based on the thread-level speculation technique, these models improve the parallelization effect of multilevel nested loops to some extent. However, as the number of nested loop levels of the program increases, the iteration dependency between loops becomes complicated, which makes them still unable to achieve the desired performance improvement [32,[42][43][44]. e authors in [45][46][47] proposed frameworks for misspeculation based on the loop cost in the compiler.…”

Section: Related Workmentioning

confidence: 99%

Loop Selection for Multilevel Nested Loops Using a Genetic Algorithm

Nie

Zhou

Huang

et al. 2021

Mathematical Problems in Engineering

View full text Add to dashboard Cite

Loop selection for multilevel nested loops is a very difficult problem, for which solutions through the underlying hardware-based loop selection techniques and the traditional software-based static compilation techniques are ineffective. A genetic algorithm- (GA-) based method is proposed in this study to solve this problem. First, the formal specification and mathematical model of the loop selection problem are presented; then, the overall framework for the GA to solve the problem is designed based on the mathematical model; finally, we provide the chromosome representation method and fitness function calculation method, the initial population generation algorithm and chromosome improvement methods, the specific implementation methods of genetic operators (crossover, mutation, and selection), the offspring population generation method, and the GA stopping criterion during the GA operation process. Experimental tests with the SPEC2006 and NPB3.3.1 standard test sets were performed on the Sunway TaihuLight supercomputer. The test results indicated that the proposed method can achieve a speedup improvement that is superior to that by the current mainstream methods, which confirm the effectiveness of the proposed method. Solving the loop selection problem of multilevel nested loops is of great practical significance for exploiting the parallelism of general scientific computing programs and for giving full play to the performance of multicore processors.

show abstract

“…LU factorization is often known to be implemented in LAPACK library to exploit the standard software library architectures [17]. W Z factorization offers parallelization in solving both sparse and dense linear system to enhance performance using OpenMP, CUDA, BLAS or EDK HW/SW codesign architecture [1,14]. Then, Yalamov [42] presented that W Z factorization is faster on computer with a parallel architecture than any other matrix factorization methods.…”

Section: Introductionmentioning

confidence: 99%

Optimized Cramerâ€™s Rule in WZ Factorization and Applications

Babarinsa

Sofi

Ibrahim

et al. 2020

Eur. J. Pure Appl. Math.

View full text Add to dashboard Cite

In this paper, W Z factorization is optimized with a proposed Cramer’s rule and compared with classical Cramer’s rule to solve the linear systems of the factorization technique. The matrix norms and performance time of WZ factorization together with LU factorization are analyzed using sparse matrices on MATLAB via AMD and Intel processor to deduce that the optimized Cramer’s rule in the factorization algorithm yields accurate results than LU factorization and conventional W Z factorization. In all, the matrix group and Schur complement for every Zsystem (2×2 block triangular matrices from Z-matrix) are established.

show abstract

The Parallel Tiled WZ Factorization Algorithm for Multicore Architectures

Cited by 3 publications

References 20 publications

A Review on Quadrant Interlocking Factorization: WZ and WH Factorization

A Review on Quadrant Interlocking Factorization: WZ and WH Factorization

Loop Selection for Multilevel Nested Loops Using a Genetic Algorithm

Optimized Cramerâ€™s Rule in WZ Factorization and Applications

Contact Info

Product

Resources

About