Proceedings of the HPC Asia 2023 Workshops 2023
DOI: 10.1145/3581576.3581610
|View full text |Cite
|
Sign up to set email alerts
|

Wilson matrix kernel for lattice QCD on A64FX architecture

Abstract: We study the implementation of the even-odd Wilson fermion matrix for lattice QCD simulations on the A64FX architecture. Efficient coding of the stencil operation is investigated for two-dimensional packing to SIMD vectors. We measure the sustained performance on the supercomputer Fugaku at RIKEN R-CCS and show the profiler result of our code, which may signal an unexpected source of slow-down in addition to the detailed efficiency of each part of the code.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…This performance drop of the even-odd operator has disappeared after the post-conference tuning[20].…”
mentioning
confidence: 97%
“…This performance drop of the even-odd operator has disappeared after the post-conference tuning[20].…”
mentioning
confidence: 97%