DL_POLY - A performance overview analysing, understanding and exploiting available HPC technology

Guest, Martyn F.; Elena, Alin M.; Chalk, Aidan B. G.

doi:10.1080/08927022.2019.1603380

Cited by 10 publications

(6 citation statements)

References 35 publications

(38 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The reason that DL_FFLUX scales worse than DL_POLY in the L ′ = 0 case is once again due to the redundant work when predicting point charges. Note that the scalability of the MPI in various versions of DL_POLY has been tested extensively in the past. , …”

Section: Resultsmentioning

confidence: 99%

“…Note that the scalability of the MPI in various versions of DL_POLY has been tested extensively in the past. 56,57 Profiling also gives us some insight into the percentage of the total runtime spent in DL_FFLUX and DL_POLY routines (as well as MPI-specific routines), which is presented in Figure 16 as pie charts. For the sake of clarity, just the profiles for N p = 1, 8, and 36 are shown; the rest of the pie charts appear in Figure S9 as well as a more detailed "sample" breakdown in Figure S10 showing the relative timings of the top 5 most costly subroutines.…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

DL_FFLUX: A Parallel, Quantum Chemical Topology Force Field

Symons

Bane

Popelier

2021

J. Chem. Theory Comput.

View full text Add to dashboard Cite

DL_FFLUX is a force field based on quantum chemical topology that can perform molecular dynamics for flexible molecules endowed with polarizable atomic multipole moments (up to hexadecapole). Using the machine learning method kriging (aka Gaussian process regression), DL_FFLUX has access to atomic properties (energy, charge, dipole moment, etc.) with quantum mechanical accuracy. Newly optimized and parallelized using domain decomposition Message Passing Interface (MPI), DL_FFLUX is now able to deliver this rigorous methodology at scale while still in reasonable time frames. DL_FFLUX is delivered as an add-on to the widely distributed molecular dynamics code DL_POLY 4.08. For the systems studied here (10 3 − 10 5 atoms), DL_FFLUX is shown to add minimal computational cost to the standard DL_POLY package. In fact, the optimization of the electrostatics in DL_FFLUX means that, when high-rank multipole moments are enabled, DL_FFLUX is up to 1.25× faster than standard DL_POLY. The parallel DL_FFLUX preserves the quality of the scaling of MPI implementation in standard DL_POLY. For the first time, it is feasible to use the full capability of DL_FFLUX to study systems that are large enough to be of real-world interest. For example, a fully flexible, high-rank polarized (up to and including quadrupole moments) 1 ns simulation of a system of 10 125 atoms (3375 water molecules) takes 30 h (wall time) on 18 cores.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

DL_FFLUX: A Parallel, Quantum Chemical Topology Force Field

Symons

Bane

Popelier

2021

J. Chem. Theory Comput.

View full text Add to dashboard Cite

show abstract

“…The PIM approach has not been widely applied for the simulation of oxide glasses due to its complex nature which makes it very complicated to be parameterized and it has not been extended to many elements, lacking in transferability. Furthermore, the majority of the most used MD programs (Dl_Poly [150], LAMMPS [151], …) do not allow to use this model. To our knowledge, the applications were limited to the simulation of SiO 2 [58,152], GeO 2 [153], B 2 O 3 [154] , and Na-borosilicate [149] amorphous systems leading to a good computation of vibrational spectra and allowing to stabilize the formation of boroxol rings in B 2 O 3 , producing ~33% of rings, better than other force fields.…”

Section: The Polarizable Interatomic Modelsmentioning

confidence: 99%

Interatomic potentials for oxide glasses: Past, present, and future

Pedone

Bertani

Brugnoli

et al. 2022

Journal of Non-Crystalline Solids: X

View full text Add to dashboard Cite

“…However, improvements in code performance will of course depend on the end-user application, e.g. a classical molecular dynamics code may be compute-bound [7] whilst CFD codes are typically memory-bound [8]. This calls for heterogeneous HPC systems that comprise a variety of chip architectures, allowing users to maximise the performance of their specific computing applications Gray et al [9].…”

Section: Introductionmentioning

confidence: 99%

On the performance of a highly-scalable Computational Fluid Dynamics code on AMD, ARM and Intel processors

Ouro,

Lopez-Novoa,

Guest

2020

Preprint

View full text Add to dashboard Cite

No area of computing is hungrier for performance than High Performance Computing (HPC), the demands of which continue to be a major driver for processor performance and adoption of accelerators, and also advances in memory, storage, and networking technologies. A key feature of the Intel processor domination of the past decade has been the extensive adoption of GPUs as coprocessors, whilst more recent developments have seen the increased availability of a number of CPU processors, including the novel ARM-based chips. This paper analyses the performance and scalability of a state-of-the-art Computational Fluid Dynamics (CFD) code on three HPC cluster systems equipped with AMD EPYC-Rome (EPYC, 4096 cores), ARM-based Marvell ThunderX2 (TX2, 8192 cores) and Intel Skylake (SKL, 8000 cores) processors. Three benchmark cases are designed with increasing computation-to-communication ratio and numerical complexity, namely lid-driven cavity flow, Taylor-Green vortex and a travelling solitary wave using the level-set method, adopted with 4 th -order central-differences or a 5 th -order WENO scheme. Our results show that the EPYC cluster de-

show abstract

DL_POLY - A performance overview analysing, understanding and exploiting available HPC technology

Cited by 10 publications

References 35 publications

DL_FFLUX: A Parallel, Quantum Chemical Topology Force Field

DL_FFLUX: A Parallel, Quantum Chemical Topology Force Field

Interatomic potentials for oxide glasses: Past, present, and future

On the performance of a highly-scalable Computational Fluid Dynamics code on AMD, ARM and Intel processors

Contact Info

Product

Resources

About