Comprehensive exploration of graphically defined reaction spaces

Zhao, Qiyuan; Vaddadi, Sai Mahit; Woulfe, Michael; Ogunfowora, Lawal Adewale; Garimella, Sanjay S; Isayev, Olexandr; Savoie, Brett M.

doi:10.1038/s41597-023-02043-z

Cited by 21 publications

(20 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Chemprop and EGAT models used in this paper were trained on the RGD1 data set, which contains around 177,000 reactions with up to 10 heavy atoms consisting of carbon, hydrogen, nitrogen, and oxygen. In brief, the RGD1 data set was generated by a graph-based enumeration of ∼700,000 reactions involving reactants sampled from PubChem. , A reaction conformational sampling strategy , was applied to generate up to three conformations for each reaction that were used to initialize double-ended TS searches, followed by Berny optimization, and intrinsic reaction coordinate (IRC) validation at the GFN2-xTB level of theory.…”

Section: Methodsmentioning

confidence: 99%

Graph to Activation Energy Models Easily Reach Irreducible Errors but Show Limited Transferability

Vadaddi,

Zhao,

Savoie

2024

J. Phys. Chem. A

Self Cite

View full text Add to dashboard Cite

Activation energy characterization of competing reactions is a costly but crucial step for understanding the kinetic relevance of distinct reaction pathways, product yields, and myriad other properties of reacting systems. The standard methodology for activation energy characterization has historically been a transition state search using the highest level of theory that can be afforded. However, recently, several groups have popularized the idea of predicting activation energies directly based on nothing more than the reactant and product graphs, a sufficiently complex neural network, and a broad enough data set. Here, we have revisited this task using the recently developed Reaction Graph Depth 1 (RGD1) transition state data set and several newly developed graph attention architectures. All of these new architectures achieve similar state-of-the-art results of ∼4 kcal/mol mean absolute error on withheld testing sets of reactions but poor performance on external testing sets composed of reactions with differing mechanisms, reaction molecularity, or reactant size distribution. Limited transferability is also shown to be shared by other contemporary graph to activation energy architectures through a series of case studies. We conclude that an array of standard graph architectures can already achieve results comparable to the irreducible error of available reaction data sets but that out-of-distribution performance remains poor.

show abstract

Section: Methodsmentioning

confidence: 99%

Graph to Activation Energy Models Easily Reach Irreducible Errors but Show Limited Transferability

Vadaddi,

Zhao,

Savoie

2024

J. Phys. Chem. A

Self Cite

View full text Add to dashboard Cite

show abstract

“…The reaction graph depth (RGD1) dataset 33 is implemented to test the performance of chemical reaction prediction. It contains 176 992 organic reactions with validated transition states, activation energy, heat of reaction, reactant and product geometries, frequencies, and atom mapping.…”

Section: Methodsmentioning

confidence: 99%

Atomic fragment approximation from a tensor network

Lin,

Zhu

2023

Digital Discovery

View full text Add to dashboard Cite

show abstract

“…We would like to note that it would also be possible to describe this approach as tracking additional information about the columns and rows in the so-called bond and electron (BE) matrix . The counting of elements in such a BE matrix is similar to how some automated reaction exploration software packages restrict explorations. − …”

Section: Theoretical Backgroundmentioning

confidence: 99%

Accelerating Reaction Network Explorations with Automated Reaction Template Extraction and Application

Unsleber

2023

J. Chem. Inf. Model.

View full text Add to dashboard Cite

Autonomously exploring chemical reaction networks with first-principles methods can generate vast data. Especially autonomous explorations without tight constraints risk getting trapped in regions of reaction networks that are not of interest. In many cases, these regions of the networks are only exited once fully searched. Consequently, the required human time for analysis and computer time for data generation can make these investigations unfeasible. Here, we show how simple reaction templates can facilitate the transfer of chemical knowledge from expert input or existing data into new explorations. This process significantly accelerates reaction network explorations and improves cost-effectiveness. We discuss the definition of the reaction templates and their generation based on molecular graphs. The resulting simple filtering mechanism for autonomous reaction network investigations is exemplified with a polymerization reaction.

show abstract

Comprehensive exploration of graphically defined reaction spaces

Cited by 21 publications

References 40 publications

Graph to Activation Energy Models Easily Reach Irreducible Errors but Show Limited Transferability

Graph to Activation Energy Models Easily Reach Irreducible Errors but Show Limited Transferability

Atomic fragment approximation from a tensor network

Accelerating Reaction Network Explorations with Automated Reaction Template Extraction and Application

Contact Info

Product

Resources

About