Proceedings of 34th Annual International Symposium on Lattice Field Theory — PoS(LATTICE2016) 2016
DOI: 10.22323/1.256.0270
|View full text |Cite
|
Sign up to set email alerts
|

MILC Staggered Conjugate Gradient Performance on Intel KNL

Abstract: We review our work done to optimize the staggered conjugate gradient (CG) algorithm in the MILC code for use with the Intel Knights Landing (KNL) architecture. KNL is the second generation Intel Xeon Phi processor. It is capable of massive thread parallelism, data parallelism, and high on-board memory bandwidth and is being adopted in supercomputing centers for scientific research. The CG solver consumes the majority of time in production running, so we have spent most of our effort on it. We compare performan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 5 publications
0
6
0
Order By: Relevance
“…Moreover, both works analyze traditional scientific applications while we also evaluate the data-analytics applications. Other studies [5], [6] focus on the performance of a specific application on a KNL system and show how to optimize the application to increase the performance benefit. More specifically, Bálint et al [6] shows of a performance analysis of linear solvers and kernels used in Lattice Quantumchromodynamics (QCD) on 16 KNL nodes.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Moreover, both works analyze traditional scientific applications while we also evaluate the data-analytics applications. Other studies [5], [6] focus on the performance of a specific application on a KNL system and show how to optimize the application to increase the performance benefit. More specifically, Bálint et al [6] shows of a performance analysis of linear solvers and kernels used in Lattice Quantumchromodynamics (QCD) on 16 KNL nodes.…”
Section: Related Workmentioning
confidence: 99%
“…This work also emphasizes the importance of using hyper-threads to effectively exploit HBM bandwidth. DeTar et al [5] optimize a staggered conjugate gradient solver in the MILC application for KNL and report that use of HBM brings a large improvement for the MILC application.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Overall three USQCD codes (Chroma, MILC, CPS) askwere chosen to be Tier-1 NERSC Exascale Application Partnership (NESAP) codes, with another (QLua) becoming a Tier-2 code. As part of this partnership, these codes were further developed and optimized through hackathons and dungeon sessions with NERSC and Intel [106,107]. In the context of USQCD, Jefferson Lab deployed a KNL cluster in 2016 and enlarged it in 2018.…”
Section: Utilizing Intel Xeon Phi Tm Architecturementioning
confidence: 99%
“…From a HPC viewpoint, a clear advantage of this operator with precomputed V µ is that its stencil is restricted to sites which are at most one hop away. Still, it is not trivial to reach an acceptable performance on a many-core architecture [4,5].…”
Section: Staggered Kernel Details and Performancementioning
confidence: 99%