“…Given a difference in energy between these states from an inexpensive reference method, ∆E ref , we train functionals to minimize the mean squared deviation between the corrected energy difference, ∆E ref + ∆E target , and a target energy difference, ∆E target (in this work, singlet-triplet energy splittings from MRCISD-F12+Q); this training scheme is outlined in Figure1. Although this centering of the loss function solely on relative energies stands in contrast to previous work in NeuralXC,21 DeepKS,22 OrbNet,23 and KDFA,29 it has three advantages: (i) it allows benchmark results to be obtained from a variety of different sources (including experiment, which almost always yields relative energies); (ii) relative energies are the quantities of most interest to chemists, since bond energies, energies of reaction, and barrier heights are all relative energies; and (iii) theoretical data used for training is almost always more accurate for relative energies than for absolute energies. For optimization of parameters and hyperparameters, the 360 carbenes were split into a training set of 289 carbenes, a validation set of 37 carbenes, and a test set of 36 carbenes.…”