2011
DOI: 10.48550/arxiv.1106.4964
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Efficient implementation of the overlap operator on multi-GPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…Compared to the earlier implementation of the overlap operator [21], the current implementation further improves the performance of data exchange on different nodes of the cluster and uses the polynomial approximation for the overlap operator instead of the rational approximation, and has achieved better scaling and further speed up of the calculation by a factor of two on average [22].…”
Section: Numerical Detailsmentioning
confidence: 99%
“…Compared to the earlier implementation of the overlap operator [21], the current implementation further improves the performance of data exchange on different nodes of the cluster and uses the polynomial approximation for the overlap operator instead of the rational approximation, and has achieved better scaling and further speed up of the calculation by a factor of two on average [22].…”
Section: Numerical Detailsmentioning
confidence: 99%
“…Since then LQCD has been at the forefront of the adoption of GPUs in HPC. Notable publications include multi-GPU parallelization [21,22,23], the use of additive-Schwarz preconditioning to improve strong scaling [24], implementation of multi-grid solvers [25], software-managed cacheblocking strategies [26], and JIT-compilation to enable the offload of the entire underlying data-parallel framework of the Chroma [27] application without any high-level source changes [28].…”
Section: Previous Workmentioning
confidence: 99%
“…The low-energy constant ∆ mix , which measures the mismatch of the mixed valence and sea pion masses between the domain-wall fermion and the overlap fermion, is shown to be very small [13]. Since overlap fermion accommodates the multi-mass algorithm and the eigenvectors are the same for different quark masses, we use 1000 pairs of eigenvectors plus the zero modes for deflation in calculating quark propagators for several masses on 45 configurations (see Ref [14] for details). The bare mass parameters are chosen as am (val) q = 0.00170, 0.00240, 0.00300, 0.00455, 0.00600 and 0.02030, which give the pion mass ranging from 114 to 371 MeV.…”
Section: ρ Meson Massmentioning
confidence: 99%