Auto-tuning the Matrix Powers Kernel with SEJITS

Morlan, Jeffrey; Kamil, Shoaib; Fox, Armando

doi:10.1007/978-3-642-38718-0_36

Lecture Notes in Computer Science

2013

DOI: 10.1007/978-3-642-38718-0_36

|View full text |Cite

Auto-tuning the Matrix Powers Kernel with SEJITS

Jeffrey Morlan

Shoaib Kamil

Armando Fox

Abstract: Abstract. The matrix powers kernel, used in communication-avoiding Krylov subspace methods, requires runtime auto-tuning for best performance. We demonstrate how the SEJITS (Selective Embedded Just-InTime Specialization) approach can be used to deliver a high-performance and performance-portable implementation of the matrix powers kernel to application authors, while separating their high-level concerns from those of auto-tuner implementers involving low-level optimizations. The benefits of delivering this ker… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, those schemes require either redundant computations (explicit schemes) and/or irregular accesses to the matrix entries with bookkeeping (implicit schemes), resulting in performance bottlenecks. In [17] a runtime auto-tuning was introduced for the MPK scheme described above to choose the appropriate parameters (e.g., explicit vs. implicit schemes) for a given matrix. This was generalized to various kernels like Jacobi and serial Gauss-Seidel iterative solvers and automated using a sparse tiling algorithm via the power of loop chain abstraction [18], [19].…”

mentioning

confidence: 99%

Level-Based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication

Alappat¹,

Hager²,

Schenk

et al. 2023

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

The multiplication of a sparse matrix with a dense vector (SpMV) is a key component in many numerical schemes and its performance is known to be severely limited by main memory access. Several numerical schemes require the multiplication of a sparse matrix polynomial with a dense vector which is typically implemented as a sequence of SpMVs. This results in low performance and ignores the potential to increase the arithmetic intensity by reusing the matrix data from cache. In this work we use the recursive algebraic coloring engine (RACE) to enable blocking of sparse matrix data across the polynomial computations. In the graph representing the sparse matrix we form levels using a breadth-first search. Locality relations of these levels are then used to improve spatial and temporal locality when accessing the matrix data and to implement an efficient multithreaded parallelization. Our approach is independent of the matrix structure and avoids shortcomings of existing "blocking" strategies in terms of hardware efficiency and parallelization overhead. We quantify the quality of our implementation using performance modelling and demonstrate speedups of up to 3× and 5× compared to an optimal SpMV-based baseline on a single multicore chip of recent Intel and AMD architectures. Various numerical schemes like s-step Krylov solvers, polynomial preconditioners and power clustering algorithms will benefit from our development.

show abstract

mentioning

confidence: 99%

Level-Based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication

Alappat¹,

Hager²,

Schenk

et al. 2023

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Auto-tuning the Matrix Powers Kernel with SEJITS

Cited by 1 publication

References 8 publications

Level-Based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication

Level-Based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication

Contact Info

Product

Resources

About