2023
DOI: 10.1038/s41597-023-02043-z
|View full text |Cite
|
Sign up to set email alerts
|

Comprehensive exploration of graphically defined reaction spaces

Abstract: Existing reaction transition state (TS) databases are comparatively small and lack chemical diversity. Here, this data gap has been addressed using the concept of a graphically-defined model reaction to comprehensively characterize a reaction space associated with C, H, O, and N containing molecules with up to 10 heavy (non-hydrogen) atoms. The resulting dataset is composed of 176,992 organic reactions possessing at least one validated TS, activation energy, heat of reaction, reactant and product geometries, f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 21 publications
(20 citation statements)
references
References 40 publications
0
18
0
Order By: Relevance
“…The Chemprop and EGAT models used in this paper were trained on the RGD1 data set, which contains around 177,000 reactions with up to 10 heavy atoms consisting of carbon, hydrogen, nitrogen, and oxygen. In brief, the RGD1 data set was generated by a graph-based enumeration of ∼700,000 reactions involving reactants sampled from PubChem. , A reaction conformational sampling strategy , was applied to generate up to three conformations for each reaction that were used to initialize double-ended TS searches, followed by Berny optimization, and intrinsic reaction coordinate (IRC) validation at the GFN2-xTB level of theory.…”
Section: Methodsmentioning
confidence: 99%
“…The Chemprop and EGAT models used in this paper were trained on the RGD1 data set, which contains around 177,000 reactions with up to 10 heavy atoms consisting of carbon, hydrogen, nitrogen, and oxygen. In brief, the RGD1 data set was generated by a graph-based enumeration of ∼700,000 reactions involving reactants sampled from PubChem. , A reaction conformational sampling strategy , was applied to generate up to three conformations for each reaction that were used to initialize double-ended TS searches, followed by Berny optimization, and intrinsic reaction coordinate (IRC) validation at the GFN2-xTB level of theory.…”
Section: Methodsmentioning
confidence: 99%
“…The reaction graph depth (RGD1) dataset 33 is implemented to test the performance of chemical reaction prediction. It contains 176 992 organic reactions with validated transition states, activation energy, heat of reaction, reactant and product geometries, frequencies, and atom mapping.…”
Section: Methodsmentioning
confidence: 99%
“…We would like to note that it would also be possible to describe this approach as tracking additional information about the columns and rows in the so-called bond and electron (BE) matrix . The counting of elements in such a BE matrix is similar to how some automated reaction exploration software packages restrict explorations. …”
Section: Theoretical Backgroundmentioning
confidence: 99%