2023
DOI: 10.1002/cmtd.202200062
|View full text |Cite
|
Sign up to set email alerts
|

Reaction Impurity Prediction using a Data Mining Approach**

Abstract: Automated prediction of reaction impurities is useful in early‐stage reaction development, synthesis planning and optimization. Existing reaction predictors are catered towards main product prediction, and are often black‐box, making it difficult to troubleshoot erroneous outcomes. This work aims to present an automated, interpretable impurity prediction workflow based on data mining large chemical reaction databases. A 14‐step workflow was implemented in Python and RDKit using Reaxys® data. Evaluation of pote… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 43 publications
(84 reference statements)
0
4
0
Order By: Relevance
“…The Heuristic Step: ChemBalancer. ChemBalancer was partially adapted from Arun et al's balancing algorithm, 14 which was originally proposed as a step to identify chemical impurities produced in reactions. The workflow of Chem-Balancer is shown and summarized in Figure 2.…”
Section: ■ Methodsmentioning
confidence: 99%
“…The Heuristic Step: ChemBalancer. ChemBalancer was partially adapted from Arun et al's balancing algorithm, 14 which was originally proposed as a step to identify chemical impurities produced in reactions. The workflow of Chem-Balancer is shown and summarized in Figure 2.…”
Section: ■ Methodsmentioning
confidence: 99%
“…Furthermore, the interpretability and transparency of data mining results are crucial for building trust among stakeholders and understanding complex data analyses. The challenges of data quality and preprocessing are also significant, as inadequate data handling can significantly impact data mining outcomes [ 310 , 311 , 312 , 313 , 314 , 315 , 316 ].…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, the interpretability and transparency of data mining results are crucial for building trust among stakeholders and understanding complex data analyses. The challenges of data quality and preprocessing are also significant, as inadequate data handling can significantly impact data mining outcomes [72][73][74][75][76][77][78].…”
Section: Introductionmentioning
confidence: 99%