2020
DOI: 10.26434/chemrxiv.13012679
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Atom-to-Atom Mapping: A Benchmarking Study of Popular Mapping Algorithms and Consensus Strategies

Abstract: Here, we discuss a reaction standardization protocol followed by a comparison of popular Atom-to-atom mapping (AAM) tools (ChemAxon, Indigo, RDTool, NextMove and RXNMapper) as well as some consensus AAM strategies. For this purpose, a dataset of 1851 manually curated and mapped reactions was prepared (the Golden dataset) and used as a reference set. It has been found that RXNMapper possesses the highest accuracy, despite the fact that it has some clear disadvantages. Finally, RXNMapper was selected as the best… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 9 publications
1
3
0
Order By: Relevance
“…For 86% (4706) of these reactions, the mappings are fully consistent (Figure 3b), with all atom pairs being consistent between MetaCyc and the results of RDT. This is similar to the reported findings from other comparative analyses (Preciat Gonzalez et al., 2017; Madzhidov et al., 2020, Figure 3a). The remaining 14% (760) of reactions showed inconsistencies between MetaCyc and the results of RDT, which are due to the different algorithms for their generation.…”
Section: Resultssupporting
confidence: 93%
See 2 more Smart Citations
“…For 86% (4706) of these reactions, the mappings are fully consistent (Figure 3b), with all atom pairs being consistent between MetaCyc and the results of RDT. This is similar to the reported findings from other comparative analyses (Preciat Gonzalez et al., 2017; Madzhidov et al., 2020, Figure 3a). The remaining 14% (760) of reactions showed inconsistencies between MetaCyc and the results of RDT, which are due to the different algorithms for their generation.…”
Section: Resultssupporting
confidence: 93%
“…RDT, in its original implementation, is not designed to allow multiple mappings; recently Madzhidov et al. (2020) created an extended version of RDT, featuring this functionality. Usage of this new RDT version in the proposed workflow is possible and will be attempted in future extensions.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the last years, the use of machine learning and data-driven approaches proved to be a very effective way to capture patterns from complex chemistry knowledge collections. 13 The extraction of chemical reaction rules from large data sets of traditional organic chemistry reactions 14 is one of the most successful examples 15 of providing transparency and explainability with AI applications in chemistry. While traditional synthetic organic chemistry went through its renaissance period thanks to recent development in machine learning and the availability of public chemical reaction datasets, the impact in biochemistry remained mostly bounded to the context of metabolic pathways prediction.…”
Section: Introductionmentioning
confidence: 99%