2018
DOI: 10.1021/acs.jctc.8b00342
|View full text |Cite
|
Sign up to set email alerts
|

Phaseless Auxiliary-Field Quantum Monte Carlo on Graphical Processing Units

Abstract: We present an implementation of phaseless Auxiliary-Field Quantum Monte Carlo (ph-AFQMC) utilizing graphical processing units (GPUs). The AFQMC method is recast in terms of matrix operations which are spread across thousands of processing cores and are executed in batches using custom Compute Unified Device Architecture kernels and the GPU-optimized cuBLAS matrix library. Algorithmic advances include a batched Sherman-Morrison-Woodbury algorithm to quickly update matrix determinants and inverses, density-fitti… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
103
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
9

Relationship

6
3

Authors

Journals

citations
Cited by 52 publications
(105 citation statements)
references
References 110 publications
2
103
0
Order By: Relevance
“…Recently we have introduced a correlated sampling (CS) approach for quantities involving energy differences which is capable of reducing computational prefactors 46 and in some cases the severity of the phaseless approximation. 43 In this section, we show that significant reductions in statistical errors are obtained not only for hydrogen abstraction reactions, as shown previously, but also for bond breaking events between a transition metal and a heavier ligand atom.…”
Section: Correlated Sampling For Bdessupporting
confidence: 77%
See 1 more Smart Citation
“…Recently we have introduced a correlated sampling (CS) approach for quantities involving energy differences which is capable of reducing computational prefactors 46 and in some cases the severity of the phaseless approximation. 43 In this section, we show that significant reductions in statistical errors are obtained not only for hydrogen abstraction reactions, as shown previously, but also for bond breaking events between a transition metal and a heavier ligand atom.…”
Section: Correlated Sampling For Bdessupporting
confidence: 77%
“…46 The second advance is the development of an efficient implementation of ph-AFQMC on graphical processing units (GPUs), including the use of the Sherman-Morrison-Woodbury (SMW) algorithm to accelerate calculations using multideterminental trial wavefunctions. 43 For problems where correlated sampling is applicable, the combination of these two techniques can reduce the computational effort by more than two orders of magnitude, enabling the method to be applied to larger systems, and also to substantially larger data sets. Further efficiency improvements are feasible (reducing both the scaling and the prefactor), leading to the possibility that ph-AFQMC will emerge as a scalable benchmark methodology for transition metal-containing systems.…”
Section: Introductionmentioning
confidence: 99%
“…64 ST gaps are extrapolated to the complete basis set (CBS) limit using exponential and 1/x 3 forms for the mean-field and correlation energies, respectively, as detailed in, e.g., Ref. 61.…”
Section: Restricted Hf (Uhf) Restricted Hf (Rhf) and Its Open-shell mentioning
confidence: 99%
“…We also find that the memory requirements using this approach are significantly reduced from the conventional AFQMC algorithm using Cholesky decomposition alone. The algorithmic advances may also be used in conjunction with parallel efforts to accelerate AFQMC through improved hardware implementations [54]. While more work is necessary to establish the relative benefits in AFQMC of exposing sparsity through low rank, as in the current work, versus the direct utilization of sparse operations, we expect such combinations to greatly advance the practical possibilities for AFQMC calculations on large systems.…”
Section: Discussionmentioning
confidence: 89%