Ferenc Karsai scite author profile

Realistic finite temperature simulations of matter are a formidable challenge for first principles methods. Long simulation times and large length scales are required, demanding years of compute time. Here we present an on-the-fly machine learning scheme that generates force fields automatically during molecular dynamics simulations. This opens up the required time and length scales, while retaining the distinctive chemical precision of first principles methods and minimizing the need for human intervention. The method is widely applicable to multi-element complex systems. We demonstrate its predictive power on the entropy driven phase transitions of hybrid perovskites, which have never been accurately described in simulations. Using machine learned potentials, isothermalisobaric simulations give direct insight into the underlying microscopic mechanisms. Finally, we relate the phase transition temperatures of different perovskites to the radii of the involved species, and we determine the order of the transitions in Landau theory.

show abstract

On-the-fly machine learning force field generation: Application to melting points

Jinnouchi

2019

View full text Add to dashboard Cite

An efficient and robust on-the-fly machine learning force field method is developed and integrated into an electronic-structure code. This method realizes automatic generation of machine learning force fields on the basis of Bayesian inference during molecular dynamics simulations, where the first principles calculations are only executed, when new configurations out of already sampled datasets appear. The developed method is applied to the calculation of melting points of Al, Si, Ge, Sn and MgO. The applications indicate that more than 99 % of the first principles calculations are bypassed during the force field generation. This allows the machine to quickly construct first principles datasets over wide phase spaces. Furthermore, with the help of the generated machine learning force fields, simulations are accelerated by a factor of thousand compared with first principles calculations. Accuracies of the melting points calculated by the force fields are examined by thermodynamic perturbation theory, and the examination indicates that the machine learning force fields can quantitatively reproduce the first principles melting points.

show abstract

Electron–phonon coupling in semiconductors within the GW approximation

et al. 2018

View full text Add to dashboard Cite

The magnitude of the renormalization of the band gaps due to zero-point motions of the lattice is calculated for 18 semiconductors, including diamond and silicon. This particular collection of semiconductors constitute a wide range of band gaps and atomic masses. The renormalized electronic structures are obtained using stochastic methods to sample the displacement related to the vibrations in the lattice. Specifically, a recently developed one-shot method is utilized where only a single calculation is needed to get similar results as the one obtained by standard Monte-Carlo sampling. In addition, a fast real-space GW method is employed and the effects of G 0 W 0 corrections on the renormalization are also investigated. We find that the band-gap renormalizations inversely depend on the mass of the constituting ions, and that for the majority of investigated compounds the G 0 W 0 corrections to the renormalization are very small and thus not significant.

show abstract

Descriptors representing two- and three-body atomic distributions and their effects on the accuracy of machine-learned inter-atomic potentials

et al. 2020

View full text Add to dashboard Cite

When determining machine-learning models for inter-atomic potentials, the potential energy surface is often described as a non-linear function of descriptors representing two- and three-body atomic distribution functions. It is not obvious how the choice of the descriptors affects the efficiency of the training and the accuracy of the final machine-learned model. In this work, we formulate an efficient method to calculate descriptors that can separately represent two- and three-body atomic distribution functions, and we examine the effects of including only two- or three-body descriptors, as well as including both, in the regression model. Our study indicates that non-linear mixing of two- and three-body descriptors is essential for an efficient training and a high accuracy of the final machine-learned model. The efficiency can be further improved by weighting the two-body descriptors more strongly. We furthermore examine a sparsification of the three-body descriptors. The three-body descriptors usually provide redundant representations of the atomistic structure, and the number of descriptors can be significantly reduced without loss of accuracy by applying an automatic sparsification using a principal component analysis. Visualization of the reduced descriptors using three-body distribution functions in real-space indicates that the sparsification automatically removes the components that are less significant for describing the distribution function.

show abstract

On-the-Fly Active Learning of Interatomic Potentials for Large-Scale Atomistic Simulations

Jinnouchi

Miwa

Karsai

et al. 2020

J. Phys. Chem. Lett.

123

View full text Add to dashboard Cite

The on-the-fly generation of machine-learning force fields by active-learning schemes attracts a great deal of attention in the community of atomistic simulations. The algorithms allow the machine to self-learn an interatomic potential and construct machine-learned models on the fly during simulations. State-of-the-art query strategies allow the machine to judge whether new structures are out of the training data set or not. Only when the machine judges the necessity of updating the data set with the new structures are first-principles calculations carried out. Otherwise, the yet available machine-learned model is used to update the atomic positions. In this manner, most of the first-principles calculations are bypassed during training, and overall, simulations are accelerated by several orders of magnitude while retaining almost first-principles accuracy. In this Perspective, after describing essential components of the active-learning algorithms, we demonstrate the power of the schemes by presenting recent applications.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ferenc Karsai

Phase Transitions of Hybrid Perovskites Simulated by Machine-Learning Force Fields Trained on the Fly with Bayesian Inference

On-the-fly machine learning force field generation: Application to melting points

Electron–phonon coupling in semiconductors within the GW approximation

Descriptors representing two- and three-body atomic distributions and their effects on the accuracy of machine-learned inter-atomic potentials

On-the-Fly Active Learning of Interatomic Potentials for Large-Scale Atomistic Simulations

Contact Info

Product

Resources

About