Abstract:Electronic structure calculations have the potential to predict key matter transformations for applications of strategic technological importance, from drug discovery to material science and catalysis. However, a predictive physicochemical characterization of these processes often requires accurate quantum chemical modeling of complex molecular systems with hundreds to thousands of atoms. Due to the computationally demanding nature of electronic structure calculations and the complexity of modern high-performa… Show more
“…The present work builds on the existing RI-MP2 energy algorithm in EXESS detailed in a previous publication . The MO coefficients and eigenvalues are taken from the HF stage and a similar approach is used to calculate the B ia P and J PQ intermediates.…”
Section: Algorithm and Implementationmentioning
confidence: 99%
“…The EXESS software utilizes distributed MPI parallelism with one MPI process per GPU along with an additional coordinating process for dynamic work distribution . The majority of the intermediate values are stored in a MPI shared memory window to avoid duplication across ranks and reduce communication costs …”
Section: Algorithm and Implementationmentioning
confidence: 99%
“…In this Article, we present a novel high-performance algorithm and software implementation for the calculation of RI-MP2 energy gradients on multiple GPUs, which was integrated in the EXtreme-scale Electronic Structure System (EXESS) …”
Section: Introductionmentioning
confidence: 99%
“…A major paradigm shift in this hardware has occurred over the past decade with the widespread adoption of heterogeneous architectures. Accelerators such as graphical processing units (GPUs) now often provide the vast majority (>95%) of the computational power in the fastest machines. , Due to their fundamental architectural differences with CPUs, exploiting the computational capabilities of GPUs is challenging and requires a fundamental redesign of the complex algorithms underpinning quantum chemical methods …”
This article presents a novel algorithm for the calculation of analytic energy gradients from second-order Møller− Plesset perturbation theory within the Resolution-of-the-Identity approximation (RI-MP2), which is designed to achieve high performance on clusters with multiple graphical processing units (GPUs). The algorithm uses GPUs for all major steps of the calculation, including integral generation, formation of all required intermediate tensors, solution of the Z-vector equation and gradient accumulation. The implementation in the EXtreme Scale Electronic Structure System (EXESS) software package includes a tailored, highly efficient, multistream scheduling system to hide CPU-GPU data transfer latencies and allows nodes with 8 A100 GPUs to operate at over 80% of theoretical peak floating-point performance. Comparative performance analysis shows a significant reduction in computational time relative to traditional multicore CPU-based methods, with our approach achieving up to a 95-fold speedup over the single-node performance of established software such as Q-Chem and ORCA. Additionally, we demonstrate that pairing our implementation with the molecular fragmentation framework in EXESS can drastically lower the computational scaling of RI-MP2 gradient calculations from quintic to subquadratic, enabling further substantial savings in runtime while retaining high numerical accuracy in the resulting gradients.
“…The present work builds on the existing RI-MP2 energy algorithm in EXESS detailed in a previous publication . The MO coefficients and eigenvalues are taken from the HF stage and a similar approach is used to calculate the B ia P and J PQ intermediates.…”
Section: Algorithm and Implementationmentioning
confidence: 99%
“…The EXESS software utilizes distributed MPI parallelism with one MPI process per GPU along with an additional coordinating process for dynamic work distribution . The majority of the intermediate values are stored in a MPI shared memory window to avoid duplication across ranks and reduce communication costs …”
Section: Algorithm and Implementationmentioning
confidence: 99%
“…In this Article, we present a novel high-performance algorithm and software implementation for the calculation of RI-MP2 energy gradients on multiple GPUs, which was integrated in the EXtreme-scale Electronic Structure System (EXESS) …”
Section: Introductionmentioning
confidence: 99%
“…A major paradigm shift in this hardware has occurred over the past decade with the widespread adoption of heterogeneous architectures. Accelerators such as graphical processing units (GPUs) now often provide the vast majority (>95%) of the computational power in the fastest machines. , Due to their fundamental architectural differences with CPUs, exploiting the computational capabilities of GPUs is challenging and requires a fundamental redesign of the complex algorithms underpinning quantum chemical methods …”
This article presents a novel algorithm for the calculation of analytic energy gradients from second-order Møller− Plesset perturbation theory within the Resolution-of-the-Identity approximation (RI-MP2), which is designed to achieve high performance on clusters with multiple graphical processing units (GPUs). The algorithm uses GPUs for all major steps of the calculation, including integral generation, formation of all required intermediate tensors, solution of the Z-vector equation and gradient accumulation. The implementation in the EXtreme Scale Electronic Structure System (EXESS) software package includes a tailored, highly efficient, multistream scheduling system to hide CPU-GPU data transfer latencies and allows nodes with 8 A100 GPUs to operate at over 80% of theoretical peak floating-point performance. Comparative performance analysis shows a significant reduction in computational time relative to traditional multicore CPU-based methods, with our approach achieving up to a 95-fold speedup over the single-node performance of established software such as Q-Chem and ORCA. Additionally, we demonstrate that pairing our implementation with the molecular fragmentation framework in EXESS can drastically lower the computational scaling of RI-MP2 gradient calculations from quintic to subquadratic, enabling further substantial savings in runtime while retaining high numerical accuracy in the resulting gradients.
“…To address this important question we have developed and implemented high performance, GPUaccelerated versions of the MBE and FMO methods in the EXtreme-scale Electronic Structure System (EXESS) program. 27,[30][31][32][33][34] Additionally, we have developed a new method, the Coulomb Perturbed Fragmentation (CPF) which, instead of iterating the ESP to self-consistency, calculates it only once using the densities of the individual fragments converged in vacuo, thus avoiding the outer SCF. All n-body contributions to the CPF energy are computed in the presence of the fixed ESP of the surrounding fragments.…”
We present an accuracy analysis of several fragmenta- tion methods including the embedding-free Many Body Expansion (MBE), the electrostatically-embedded MBE (EE-MBE), the Fragment Molecular Orbital (FMO) and the presently introduced Coulomb Per- turbed Fragmentation (CPF) method. We show that the iterative correction to the electrostatic potential in FMO introduces little to no accuracy gains when com- pared to the non-iterative EE-MBE and CPF methods. Additionally, we present performance comparisons of GPU accelerated implementations of most of these methods and show that MBE4 is not only more ac- curate than FMO3 but, when sufficient parallelism is provided, it is also faster.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.