David M. Wilkins scite author profile

We provide an introduction to Gaussian process regression (GPR) machine-learning methods in computational materials science and chemistry. The focus of the present review is on the regression of atomistic properties: in particular, on the construction of interatomic potentials, or force fields, in the Gaussian Approximation Potential (GAP) framework; beyond this, we also discuss the fitting of arbitrary scalar, vectorial, and tensorial quantities. Methodological aspects of reference data generation, representation, and regression, as well as the question of how a data-driven model may be validated, are reviewed and critically discussed. A survey of applications to a variety of research questions in chemistry and materials science illustrates the rapid growth in the field. A vision is outlined for the development of the methodology in the years to come.

show abstract

Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems

Grisafi

et al. 2018

View full text Add to dashboard Cite

Statistical learning methods show great promise in providing an accurate prediction of materials and molecular properties, while minimizing the need for computationally demanding electronic structure calculations. The accuracy and transferability of these models are increased significantly by encoding into the learning procedure the fundamental symmetries of rotational and permutational invariance of scalar properties. However, the prediction of tensorial properties requires that the model respects the appropriate geometric transformations, rather than invariance, when the reference frame is rotated. We introduce a formalism that extends existing schemes and makes it possible to perform machine learning of tensorial properties of arbitrary rank, and for general molecular geometries. To demonstrate it, we derive a tensor kernel adapted to rotational symmetry, which is the natural generalization of the smooth overlap of atomic positions kernel commonly used for the prediction of scalar properties at the atomic scale. The performance and generality of the approach is demonstrated by learning the instantaneous response to an external electric field of water oligomers of increasing complexity, from the isolated molecule to the condensed phase.

show abstract

Electrolytes induce long-range orientational order and free energy changes in the H-bond network of bulk water

et al. 2016

View full text Add to dashboard Cite

show abstract

Transferable Machine-Learning Model of the Electron Density

et al. 2018

View full text Add to dashboard Cite

The electronic charge density plays a central role in determining the behavior of matter at the atomic scale, but its computational evaluation requires demanding electronic-structure calculations. We introduce an atom-centered, symmetry-adapted framework to machine-learn the valence charge density based on a small number of reference calculations. The model is highly transferable, meaning it can be trained on electronic-structure data of small molecules and used to predict the charge density of larger compounds with low, linear-scaling cost. Applications are shown for various hydrocarbon molecules of increasing complexity and flexibility, and demonstrate the accuracy of the model when predicting the density on octane and octatetraene after training exclusively on butane and butadiene. This transferable, data-driven model can be used to interpret experiments, accelerate electronic structure calculations, and compute electrostatic interactions in molecules and condensed-phase systems.

show abstract

i-PI 2.0: A universal force engine for advanced molecular simulations

Kapil

Rossi

Maršálek

et al. 2019

Computer Physics Communications

285

229

View full text Add to dashboard Cite

Progress in the atomic-scale modelling of matter over the past decade has been tremendous. This progress has been brought about by improvements in methods for evaluating interatomic forces that work by either solving the electronic structure problem explicitly, or by computing accurate approximations of the solution and by the development of techniques that use the Born-Oppenheimer (BO) forces to move the atoms on the BO potential energy surface. As a consequence of these developments it is now possible to identify stable or metastable states, to sample configurations consistent with the appropriate thermodynamic ensemble, and to estimate the kinetics of reactions and phase transitions. All too often, however, progress is slowed down by the bottleneck associated with implementing new optimization algorithms and/or sampling techniques into the many existing electronic-structure and empirical-potential codes. To address this problem, we are thus releasing a new version of the i-PI software. This piece of software is an easily extensible framework for implementing advanced atomistic simulation techniques using interatomic potentials and forces calculated by an external driver code. While the original version of the code[1] was developed with a focus on path integral molecular dynamics techniques, this second release of i-PI not only includes several new advanced path integral methods, but also offers other classes of algorithms. In other words, i-PI is moving towards becoming a universal force engine that is both modular and tightly coupled to the driver codes that evaluate the potential energy surface and its derivatives.

show abstract

Accurate molecular polarizabilities with coupled cluster theory and machine learning

Wilkins

Grisafi

Yang

et al. 2019

Proc. Natl. Acad. Sci. U.S.A.

167

202

View full text Add to dashboard Cite

The molecular polarizability describes the tendency of a molecule to deform or polarize in response to an applied electric field. As such, this quantity governs key intra-and inter-molecular interactions such as induction and dispersion, plays a key role in determining the spectroscopic signatures of molecules, and is an essential ingredient in polarizable force fields and other empirical models for collective interactions. Compared to other ground-state properties, an accurate and reliable prediction of the molecular polarizability is considerably more difficult as this response quantity is quite sensitive to the description of the underlying molecular electronic structure. In this work, we present state-of-the-art quantum mechanical calculations of the static dipole polarizability tensors of 7,211 small organic molecules computed using linear-response coupled-cluster singles and doubles theory (LR-CCSD). Using a symmetry-adapted machine-learning based approach, we demonstrate that it is possible to predict the molecular polarizability with LR-CCSD accuracy at a negligible computational cost. The employed model is quite robust and transferable, yielding molecular polarizabilities for a diverse set of 52 larger molecules (which includes challenging conjugated systems, carbohydrates, small drugs, amino acids, nucleobases, and hydrocarbon isomers) at an accuracy that exceeds that of hybrid density functional theory (DFT). The atom-centered decomposition implicit in our machine-learning approach offers some insight into the shortcomings of DFT in the prediction of this fundamental quantity of interest. arXiv:1809.05337v1 [physics.chem-ph]

show abstract

Predicting molecular dipole moments by combining atomic partial charges and atomic dipoles

et al. 2020

View full text Add to dashboard Cite

The molecular dipole moment (µ) is a central quantity in chemistry. It is essential in predicting infrared and sum-frequency generation spectra, as well as induction and long-range electrostatic interactions. Furthermore, it can be extracted directly-via the ground state electron density-from high-level quantum mechanical calculations, making it an ideal target for machine learning (ML). In this work, we choose to represent this quantity with a physically inspired ML model that captures two distinct physical effects: local atomic polarization is captured within the symmetry-adapted Gaussian process regression (SA-GPR) framework, which assigns a (vector) dipole moment to each atom, while movement of charge across the entire molecule is captured by assigning a partial (scalar) charge to each atom. The resulting "MuML" models are fitted together to reproduce molecular µ computed using high-level coupled-cluster theory (CCSD) and density functional theory (DFT) on the QM7b dataset, achieving more accurate results due to the physics-based combination of these complementary terms. The combined model shows excellent transferability when applied to a showcase dataset of larger and more complex molecules, approaching the accuracy of DFT at a small fraction of the computational cost. We also demonstrate that the uncertainty in the predictions can be estimated reliably using a calibrated committee model. The ultimate performance of the models-and the optimal weighting of their combination-depend, however, on the details of the system at hand, with the scalar model being clearly superior when describing large molecules whose dipole is almost entirely generated by charge separation. These observations point to the importance of simultaneously accounting for the local and non-local effects that contribute to µ; further, they define a challenging task to benchmark future models, particularly those aimed at the description of condensed phases.

show abstract

Why Quantum Coherence Is Not Important in the Fenna–Matthews–Olsen Complex

Wilkins

Dattani

2015

J. Chem. Theory Comput.

103

View full text Add to dashboard Cite

We develop and present an improvement to the conventional technique for solving the Hierarchical Equations of Motion which reduces the memory cost by more than 75% while retaining the same convergence rate and accuracy. This allows for a full calculation of the population dynamics of the 24-site FMO trimer for long timescales with very little effort, and we present the first fully converged, exact results for the 7-site subsystem of the monomer, and for the full 24-site trimer.Owing to this new approach, our numerically exact 24-site, 2-exponential results are the most demanding HEOM calculations performed to date, surpassing the 50-site, 1-exponential results of Strumpfer and Schulten [2012, J. Chem. Thy. & Comp., 8, 2808]. We then show where our exact 7-site results deviate from the approximation of Ishizaki and Fleming [2009, Proc. Natl. Acad. Sci. USA, 106, 17255]. Our exact results are then compared to calculations using the incoherent Förster theory, and it is found that the energy transfer from the antenna to the reaction centre occurs more than 50 times faster than the fluorescence lifetime of the excitation, whether or not coherence is considered. This means that coherence is not likely to improve the efficiency of the photosynthesis.In fact, the incoherent theory often tends to over-predict the rates of energy transfer, suggesting that in some cases electronic coherence may actually slow down the photosynthetic process.

show abstract

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

David M. Wilkins

Gaussian Process Regression for Materials and Molecules

Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems

Electrolytes induce long-range orientational order and free energy changes in the H-bond network of bulk water

Transferable Machine-Learning Model of the Electron Density

i-PI 2.0: A universal force engine for advanced molecular simulations

Accurate molecular polarizabilities with coupled cluster theory and machine learning

Predicting molecular dipole moments by combining atomic partial charges and atomic dipoles

Why Quantum Coherence Is Not Important in the Fenna–Matthews–Olsen Complex

Contact Info

Product

Resources

About