Transfer Learning to CCSD(T): Accurate Anharmonic Frequencies from Machine Learning Models

Käser, Silvan; Boittier, Eric; Upadhyay, Meenu; Meuwly, Markus

doi:10.1021/acs.jctc.1c00249

Cited by 28 publications

(34 citation statements)

References 72 publications

(147 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“… 50 Interestingly, this reassignment was recently supported from second order vibrational perturbation theory (VPT2) calculations using a neural network-(NN) based PES. 23 …”

Section: Introductionmentioning

confidence: 99%

Transfer learned potential energy surfaces: accurate anharmonic vibrational dynamics and dissociation energies for the formic acid monomer and dimer

Käser

Meuwly

2022

Phys. Chem. Chem. Phys.

Self Cite

View full text Add to dashboard Cite

show abstract

“… 50 Interestingly, this reassignment was recently supported from second order vibrational perturbation theory (VPT2) calculations using a neural network-(NN) based PES. 23 …”

Section: Introductionmentioning

confidence: 99%

Transfer learned potential energy surfaces: accurate anharmonic vibrational dynamics and dissociation energies for the formic acid monomer and dimer

Käser

Meuwly

2022

Phys. Chem. Chem. Phys.

Self Cite

View full text Add to dashboard Cite

show abstract

“…To set the stage, the tunneling splittings for malonaldehyde were calculated on the Phys-Net MP2 PES using RPI theory. The tunneling splitting calculations were carried out with three different values of the imaginary time, τ , corresponding to effective 'temperatures' 25,12.5] K and with different numbers of beads N ∈ [2 5 , .., 2 12 ] to ensure convergence. Formally the instanton result is defined in the low-temperature limit, which is equivalent to infinitely-long imaginary times.…”

Section: Resultsmentioning

confidence: 99%

“…To avoid the need for calculating large ab initio data sets at high levels of theory transfer learning (TL) [18][19][20] and related ∆-ML 21 were shown to be data and cost-effective alternatives. [22][23][24][25][26][27] The combination of TL and instanton theory appears particularly appealing as the instanton path (IP) can be determined on a low-level PES, which gives a rough approximation to the true tunneling path, and can be included (and iteratively refined if needed) into the TL data set. Additionally, the IP is inherently local and, thus, allows concentrating on improving only a small part of a PES.…”

Section: Introductionmentioning

confidence: 99%

Transfer learning for affordable and high quality tunneling splittings from instanton calculations

Käser¹,

Richardson²,

Meuwly³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

The combination of transfer learning (TL) a low level potential energy surface (PES) to a higher level of electronic structure theory together with ring-polymer instanton (RPI) theory is explored and applied to malonaldehyde. The RPI approach provides a semiclassical approximation of the tunneling splitting and depends sensitively on the accuracy of the PES. With second order Møller-Plesset perturbation theory (MP2) as the low-level (LL) model and energies and forces from coupled cluster singles, doubles and perturbative triples (CCSD(T)) as the high-level (HL) model, it is demonstrated that CCSD(T) information from only 25 to 50 judiciously selected structures along and around the instanton path suffice to reach HL-accuracy for the tunneling splitting. In addition, the global quality of the HL-PES is demonstrated through a mean average error of 0.3 kcal/mol for energies up to 40 kcal/mol above the minimum energy structure (a factor of 2 higher than the energies employed during TL) and < 2 cm −1 for harmonic frequencies compared with computationally challenging normal mode calculations at the CCSD(T) level.

show abstract

“…It is interesting to note that normal mode sampling was also found to be insufficient for generating sufficiently reliable, full-dimensional NN-based near-equilibrium potential energy surfaces for harmonic and anharmonic normal modes. 82 Another determinant property is the number of heavy atoms in molecules covered in the database (Section 4.1). Not surprisingly, better results are obtained for the range covered by the database, and if a sufficient number of samples is available, e.g., when considering the performance depending on the number of heavy atoms in SetLE9.…”

Section: Discussionmentioning

confidence: 99%

“…However, using a complementary technique such as active learning can substantially improve the results, as was found for ANI-1x. It is interesting to note that normal mode sampling was also found to be insufficient for generating sufficiently reliable, full-dimensional NN-based near-equilibrium potential energy surfaces for harmonic and anharmonic normal modes …”

Section: Discussionmentioning

confidence: 99%

Impact of the Characteristics of Quantum Chemical Databases on Machine Learning Prediction of Tautomerization Energies

Vazquez-Salazar

Boittier

Unke

et al. 2021

J. Chem. Theory Comput.

Self Cite

View full text Add to dashboard Cite

An essential aspect for adequate predictions of chemical properties by machine learning models is the database used for training them. However, studies that analyze how the content and structure of the databases used for training impact the prediction quality are scarce. In this work, we analyze and quantify the relationships learned by a machine learning model (Neural Network) trained on five different reference databases (QM9, PC9, ANI-1E, ANI-1, and ANI-1x) to predict tautomerization energies from molecules in Tautobase. For this, characteristics such as the number of heavy atoms in a molecule, number of atoms of a given element, bond composition, or initial geometry on the quality of the predictions are considered. The results indicate that training on a chemically diverse database is crucial for obtaining good results and also that conformational sampling can partly compensate for limited coverage of chemical diversity. The overall best-performing reference database (ANI-1x) performs on average by 1 kcal/mol better than PC9, which, however, contains about 2 orders of magnitude fewer reference structures. On the other hand, PC9 is chemically more diverse by a factor of ∼5 as quantified by the number of atom-in-molecule-based fragments (amons) it contains compared with the ANI family of databases. A quantitative measure for deficiencies is the Kullback–Leibler divergence between reference and target distributions. It is explicitly demonstrated that when certain types of bonds need to be covered in the target database (Tautobase) but are undersampled in the reference databases, the resulting predictions are poor. Examples of this include the poor performance of all databases analyzed to predict C(sp2)–C(sp2) double bonds close to heteroatoms and azoles containing N–N and N–O bonds. Analysis of the results with a Tree MAP algorithm provides deeper understanding of specific deficiencies in predicting tautomerization energies by the reference datasets due to inadequate coverage of chemical space. Capitalizing on this information can be used to either improve existing databases or generate new databases of sufficient diversity for a range of machine learning (ML) applications in chemistry.

show abstract

Transfer Learning to CCSD(T): Accurate Anharmonic Frequencies from Machine Learning Models

Cited by 28 publications

References 72 publications

Transfer learned potential energy surfaces: accurate anharmonic vibrational dynamics and dissociation energies for the formic acid monomer and dimer

Transfer learned potential energy surfaces: accurate anharmonic vibrational dynamics and dissociation energies for the formic acid monomer and dimer

Transfer learning for affordable and high quality tunneling splittings from instanton calculations

Impact of the Characteristics of Quantum Chemical Databases on Machine Learning Prediction of Tautomerization Energies

Contact Info

Product

Resources

About