Realistic finite temperature simulations of matter are a formidable challenge for first principles methods. Long simulation times and large length scales are required, demanding years of compute time. Here we present an on-the-fly machine learning scheme that generates force fields automatically during molecular dynamics simulations. This opens up the required time and length scales, while retaining the distinctive chemical precision of first principles methods and minimizing the need for human intervention. The method is widely applicable to multi-element complex systems. We demonstrate its predictive power on the entropy driven phase transitions of hybrid perovskites, which have never been accurately described in simulations. Using machine learned potentials, isothermalisobaric simulations give direct insight into the underlying microscopic mechanisms. Finally, we relate the phase transition temperatures of different perovskites to the radii of the involved species, and we determine the order of the transitions in Landau theory.
An efficient and robust on-the-fly machine learning force field method is developed and integrated into an electronic-structure code. This method realizes automatic generation of machine learning force fields on the basis of Bayesian inference during molecular dynamics simulations, where the first principles calculations are only executed, when new configurations out of already sampled datasets appear. The developed method is applied to the calculation of melting points of Al, Si, Ge, Sn and MgO. The applications indicate that more than 99 % of the first principles calculations are bypassed during the force field generation. This allows the machine to quickly construct first principles datasets over wide phase spaces. Furthermore, with the help of the generated machine learning force fields, simulations are accelerated by a factor of thousand compared with first principles calculations. Accuracies of the melting points calculated by the force fields are examined by thermodynamic perturbation theory, and the examination indicates that the machine learning force fields can quantitatively reproduce the first principles melting points.
The magnitude of the renormalization of the band gaps due to zero-point motions of the lattice is calculated for 18 semiconductors, including diamond and silicon. This particular collection of semiconductors constitute a wide range of band gaps and atomic masses. The renormalized electronic structures are obtained using stochastic methods to sample the displacement related to the vibrations in the lattice. Specifically, a recently developed one-shot method is utilized where only a single calculation is needed to get similar results as the one obtained by standard Monte-Carlo sampling. In addition, a fast real-space GW method is employed and the effects of G 0 W 0 corrections on the renormalization are also investigated. We find that the band-gap renormalizations inversely depend on the mass of the constituting ions, and that for the majority of investigated compounds the G 0 W 0 corrections to the renormalization are very small and thus not significant.
When determining machine-learning models for inter-atomic potentials, the potential energy surface is often described as a non-linear function of descriptors representing two- and three-body atomic distribution functions. It is not obvious how the choice of the descriptors affects the efficiency of the training and the accuracy of the final machine-learned model. In this work, we formulate an efficient method to calculate descriptors that can separately represent two- and three-body atomic distribution functions, and we examine the effects of including only two- or three-body descriptors, as well as including both, in the regression model. Our study indicates that non-linear mixing of two- and three-body descriptors is essential for an efficient training and a high accuracy of the final machine-learned model. The efficiency can be further improved by weighting the two-body descriptors more strongly. We furthermore examine a sparsification of the three-body descriptors. The three-body descriptors usually provide redundant representations of the atomistic structure, and the number of descriptors can be significantly reduced without loss of accuracy by applying an automatic sparsification using a principal component analysis. Visualization of the reduced descriptors using three-body distribution functions in real-space indicates that the sparsification automatically removes the components that are less significant for describing the distribution function.
The on-the-fly generation of machine-learning force fields by active-learning schemes attracts a great deal of attention in the community of atomistic simulations. The algorithms allow the machine to self-learn an interatomic potential and construct machine-learned models on the fly during simulations. State-of-the-art query strategies allow the machine to judge whether new structures are out of the training data set or not. Only when the machine judges the necessity of updating the data set with the new structures are first-principles calculations carried out. Otherwise, the yet available machine-learned model is used to update the atomic positions. In this manner, most of the first-principles calculations are bypassed during training, and overall, simulations are accelerated by several orders of magnitude while retaining almost first-principles accuracy. In this Perspective, after describing essential components of the active-learning algorithms, we demonstrate the power of the schemes by presenting recent applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.