Distributed memory, GPU accelerated Fock construction for hybrid, Gaussian basis density functional theory

Williams‐Young, David B.; Asadchev, Andrey; Popovici, Doru Thom; Clark, David; Waldrop, Jonathan M.; Windus, Theresa L.; Valeev, Edward F.; Jong, Wibe A. de

doi:10.1063/5.0151070

Cited by 6 publications

(2 citation statements)

References 117 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus, the question remains: what is a better way to evaluate 2-electron integrals over high- l Gaussian AOs on GPUs? Our recent work on the density-fitting-accelerated J-matrix engine implementation for GPUs based on McMurchie–Davidson (MD) recurrences hinted that the MD recurrences recast in matrix form might be the way to go for some classes of integrals; that development also provided many fundamental elements that we reused in this work. We were not alone in thinking that the SHARK integral engine developed by Frank Neese and recently incorporated into the public release of ORCA program illustrated how efficient the MD scheme can be when expressed as a matrix multiplication (matmul) on conventional CPUs, at least when used for integrals over high angular momenta (SHARK supplements the MD approach with traditional Obara-Saika-based kernels implemented in the Libint library).…”

Section: Introductionmentioning

confidence: 99%

“…A possible workaround for the challenge of high-l integrals is the use of real-space factorization of 2-electron integrals, as illustrated recently by us and collaborators 24 via the use of realspace quadrature ("pseudospectral", 25 also known as chain-ofspheres 26 or seminumerical 27 ) approximation to the exact exchange, which trades the problem of computing 4-center 2electron integrals for evaluation of cheaper but much more numerous 2-center 1-electron Gaussian AO integrals. Nevertheless, these and other numerical approximations that avoid the 4-center integrals cannot entirely eliminate the need to evaluate 4-center 2-electron integrals; thus, their efficient evaluation, especially for high angular momenta, remains a critical challenge on modern HPC platforms.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

High-Performance Evaluation of High Angular Momentum 4-Center Gaussian Integrals on Modern Accelerated Processors

Asadchev,

Valeev

2023

J. Phys. Chem. A

Self Cite

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

High-Performance Evaluation of High Angular Momentum 4-Center Gaussian Integrals on Modern Accelerated Processors

Asadchev,

Valeev

2023

J. Phys. Chem. A

Self Cite

View full text Add to dashboard Cite

Special Topic on High Performance Computing in Chemical Physics

Straatsma,

Windus,

Nakajima

2023

The Journal of Chemical Physics

View full text Add to dashboard Cite

Computational modeling and simulation have become indispensable scientific tools in virtually all areas of chemical, biomolecular, and materials systems research. Computation can provide unique and detailed atomic level information that is difficult or impossible to obtain through analytical theories and experimental investigations. In addition, recent advances in micro-electronics have resulted in computer architectures with unprecedented computational capabilities, from the largest supercomputers to common desktop computers. Combined with the development of new computational domain science methodologies and novel programming models and techniques, this has resulted in modeling and simulation resources capable of providing results at or better than experimental chemical accuracy and for systems in increasingly realistic chemical environments.

show abstract

CMaize: Simplifying inter-package modularity from the build up

Crandall,

Windus,

Richard

2024

The Journal of Chemical Physics

View full text Add to dashboard Cite

There is a growing desire for inter-package modularity within the chemistry software community to reuse encapsulated code units across a variety of software packages. Most comprehensive efforts at achieving inter-package modularity will quickly run afoul of a very practical problem, being able to cohesively build the modules. Writing and maintaining build systems has long been an issue for many scientific software packages that rely on compiled languages such as C/C++. The push for inter-package modularity compounds this issue by additionally requiring binary artifacts from disparate developers to interoperate at a binary level. Thankfully, the de facto build tool for C/C++, CMake, is more than capable of supporting the myriad of edge cases that complicate writing robust build systems. Unfortunately, writing and maintaining a robust CMake build system can be a laborious endeavor because CMake provides few abstractions to aid the developer. The need to significantly simplify the process of writing robust CMake-based build systems, especially in inter-package builds, motivated us to write CMaize. In addition to describing the architecture and design of CMaize, the article also demonstrates how CMaize is used in production-level software.

show abstract

Distributed memory, GPU accelerated Fock construction for hybrid, Gaussian basis density functional theory

Cited by 6 publications

References 117 publications

High-Performance Evaluation of High Angular Momentum 4-Center Gaussian Integrals on Modern Accelerated Processors

High-Performance Evaluation of High Angular Momentum 4-Center Gaussian Integrals on Modern Accelerated Processors

Special Topic on High Performance Computing in Chemical Physics

CMaize: Simplifying inter-package modularity from the build up

Contact Info

Product

Resources

About