Multinode Multi-GPU Two-Electron Integrals: Code Generation Using the Regent Language

Johnson, Kate; Mirchandaney, Seema; Hoag, Ellis; Heirich, Alan; Aiken, Alex; Martı́nez, Todd J.

doi:10.1021/acs.jctc.2c00414

Cited by 11 publications

(7 citation statements)

References 86 publications

(158 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While it would be possible to leverage community software in the implementation of many of these kernels, it is often the case that the development of highly specialized kernels and algorithms leads to significant performance improvements on modern hardware. For example, the development of highly optimized recursions − and chemistry-specific quadrature schemes − for operator integral evaluation and contraction , are known to outperform generic numerical integral machinery on both CPU and accelerator architectures. In addition, domain-driven tensor frameworks − typically outperform generic tensor frameworks for computational chemistry workloads.…”

Section: Programming Models and Software Integrationmentioning

confidence: 99%

A Perspective on Sustainable Computational Chemistry Software Development and Integration

Di Felice,

Mayes,

Richard

et al. 2023

J. Chem. Theory Comput.

View full text Add to dashboard Cite

The power of quantum chemistry to predict the ground and excited state properties of complex chemical systems has driven the development of computational quantum chemistry software, integrating advances in theory, applied mathematics, and computer science. The emergence of new computational paradigms associated with exascale technologies also poses significant challenges that require a flexible forward strategy to take full advantage of existing and forthcoming computational resources. In this context, the sustainability and interoperability of computational chemistry software development are among the most pressing issues. In this perspective, we discuss software infrastructure needs and investments with an eye to fully utilize exascale resources and provide unique computational tools for next-generation science problems and scientific discoveries.

show abstract

Section: Programming Models and Software Integrationmentioning

confidence: 99%

A Perspective on Sustainable Computational Chemistry Software Development and Integration

Di Felice,

Mayes,

Richard

et al. 2023

J. Chem. Theory Comput.

View full text Add to dashboard Cite

show abstract

“…Typically, computing QM forces takes more than 95% of the total QM/MM time. In the recent past, various efforts have been undertaken to develop computationally affordable novel QM methods or reimplement traditional QM methods to harness the power of massively parallel central processing unit (CPU) and graphics processing unit (GPU) hardware platforms. − Most notably, a number of leading quantum chemistry software packages have been empowered with GPU acceleration allowing users to achieve unprecedented simulation speeds and model larger molecular systems efficiently. For instance, our own GPU-accelerated QUICK ab initio quantum chemistry and density functional theory package is highly efficient on NVIDIA hardware. , QM/MM simulations with QUICK/AMBER have displayed respectable speedups of up to 53 times for a single GPU with respect to a CPU core for a moderate-sized QM region size that was benchmarked at the time .…”

Section: Introductionmentioning

confidence: 99%

Quantum Mechanics/Molecular Mechanics Simulations on NVIDIA and AMD Graphics Processing Units

Manathunga

Aktulga

Goetz

et al. 2023

J. Chem. Inf. Model.

View full text Add to dashboard Cite

We have ported and optimized the graphics processing unit (GPU)-accelerated QUICK and AMBER-based ab initio quantum mechanics/molecular mechanics (QM/MM) implementation on AMD GPUs. This encompasses the entire Fock matrix build and force calculation in QUICK including one-electron integrals, twoelectron repulsion integrals, exchange-correlation quadrature, and linear algebra operations. General performance improvements to the QUICK GPU code are also presented. Benchmarks carried out on NVIDIA V100 and AMD MI100 cards display similar performance on both hardware for standalone HF/DFT calculations with QUICK and QM/MM molecular dynamics simulations with QUICK/AMBER. Furthermore, with respect to the QUICK/AMBER release version 21, significant speedups are observed for QM/MM molecular dynamics simulations. This significantly increases the range of scientific problems that can be addressed with open-source QM/MM software on state-of-the-art computer hardware.

show abstract

“…All of these factors make it especially challenging to port Gaussian integral kernels onto accelerated coprocessors, such as general-purpose graphical processing units (GPGPUs, or, simply, GPUs), that have become the norm both on the commodity and high-end platforms. Hence there has been an intense effort to address these challenges, both on the modern central processing units (CPUs) with wide single-instruction-multiple-data (SIMD) instructions and on GPUs. ,,,,− …”

Section: Introductionmentioning

confidence: 99%

Memory-Efficient Recursive Evaluation of 3-Center Gaussian Integrals

Asadchev

Valeev

2023

J. Chem. Theory Comput.

View full text Add to dashboard Cite

To improve the efficiency of Gaussian integral evaluation on modern accelerated architectures, FLOP-efficient Obara-Saika-based recursive evaluation schemes are optimized for the memory footprint. For the 3-center 2-particle integrals that are key for the evaluation of Coulomb and other 2-particle interactions in the density-fitting approximation, the use of multiquantal recurrences (in which multiple quanta are created or transferred at once) is shown to produce significant memory savings. Other innovations include leveraging register memory for reduced memory footprint and direct compile-time generation of optimized kernels (instead of custom code generation) with compile-time features of modern C++/CUDA. Performance of conventional and CUDA-based implementations of the proposed schemes is illustrated for both the individual batches of integrals involving up to Gaussians with low and high angular momenta (up to L = 6) and contraction degrees, as well as for the density-fitting-based evaluation of the Coulomb potential. The computer implementation is available in the open-source library.

show abstract

Multinode Multi-GPU Two-Electron Integrals: Code Generation Using the Regent Language

Cited by 11 publications

References 86 publications

A Perspective on Sustainable Computational Chemistry Software Development and Integration

A Perspective on Sustainable Computational Chemistry Software Development and Integration

Quantum Mechanics/Molecular Mechanics Simulations on NVIDIA and AMD Graphics Processing Units

Memory-Efficient Recursive Evaluation of 3-Center Gaussian Integrals

Contact Info

Product

Resources

About