2024
DOI: 10.26434/chemrxiv-2023-nfq7h-v2
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Simple User-Friendly Reaction Format

David F. Nippa,
Alex T. Müller,
Kenneth Atz
et al.

Abstract: Leveraging the increasing volume of chemical reaction data can enhance synthesis planning and improve suc- cess rates. However, machine learning applications for retrosynthesis planning and forward reaction prediction tools depend on having readily available, high-quality data in a structured format. While some public and licensed reaction databases are available, they frequently lack essential information about reaction condi- tions. To address this issue and promote the principles of findable, accessible, in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 38 publications
0
0
0
Order By: Relevance
“…Apart from the five mentioned extraction tasks, it can be easily extended to tasks related to extracting information from scientific literature and transforming data into a simple user-friendly reaction format 22 that is both human- and machine-readable. This approach will significantly contribute to the development of extensive databases like the Open Reaction Database, 23,24 SciFinder 25 and Reaxys, 26 which gather comprehensive synthesis data through automated curation and expert verification, to make data more findable, accessible, interoperable, and reusable (FAIR).…”
Section: Resultsmentioning
confidence: 99%
“…Apart from the five mentioned extraction tasks, it can be easily extended to tasks related to extracting information from scientific literature and transforming data into a simple user-friendly reaction format 22 that is both human- and machine-readable. This approach will significantly contribute to the development of extensive databases like the Open Reaction Database, 23,24 SciFinder 25 and Reaxys, 26 which gather comprehensive synthesis data through automated curation and expert verification, to make data more findable, accessible, interoperable, and reusable (FAIR).…”
Section: Resultsmentioning
confidence: 99%
“…In order to support the workflow but also store the data in a way that allows conversion into files amenable to machine learning, for example, the simple user-friendly reaction format (SURF), we developed a hierarchical schema to organize our data (Figure ). We clustered the data fields around the topics of Chemistry, Analytics, and Experiments.…”
Section: Hte Os-behind the Facadementioning
confidence: 99%
“…20,21 In addition, HTE generates large and consistent data sets covering both positive and, importantly, negative reaction outcomes for chemical transformations. 22,23 In line with the large amount of data generated, addressing the challenge of data sharing, adopting standardized formats and employing both human-and machine-readable reaction data formats 24 has become crucial. Repositories such as the Open Reaction Database (ORD) 25 and the Unified Data Model (UDM) 26 have already improved access to diverse data sets of reactions.…”
Section: Introductionmentioning
confidence: 99%