2021
DOI: 10.26434/chemrxiv.14346920.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Kernel Methods for Predicting Yields of Chemical Reactions

Abstract: The use of machine learning methods for the prediction of reaction yield is an emerging area. We demonstrate the applicability of support vector regression (SVR) for predicting reaction yields, using combinatorial data. Molecular descriptors used in regression tasks related to chemical reac?tivity have often been based on time-consuming, computationally demanding quantum chemical calculations, usually density functional theory. Structure-based descriptors (molecular fingerprints and molecular graphs) are quick… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 59 publications
0
8
0
Order By: Relevance
“…The concept of a domain of applicability is well-established in quantitative structure− activity relationship (QSAR) models 80 and has previously been reported in reaction informatics. 81 Generally, the uncertainty in a particular prediction is correlated to the similarity between that molecule and the molecules used to construct the model. Using Euclidean distance in the chemical space to quantify the domain of applicability (see the Supporting Information for details on distance calculation), ligands tested that were 0.5 normalized parameter space units or less from a training set ligand had a prediction mean absolute error (MAE) of 0.24 kcal/ mol for the Hayashi−Heck reaction (Figure 10, graph 2 and graph 3).…”
Section: ■ Results and Discussionmentioning
confidence: 99%
“…The concept of a domain of applicability is well-established in quantitative structure− activity relationship (QSAR) models 80 and has previously been reported in reaction informatics. 81 Generally, the uncertainty in a particular prediction is correlated to the similarity between that molecule and the molecules used to construct the model. Using Euclidean distance in the chemical space to quantify the domain of applicability (see the Supporting Information for details on distance calculation), ligands tested that were 0.5 normalized parameter space units or less from a training set ligand had a prediction mean absolute error (MAE) of 0.24 kcal/ mol for the Hayashi−Heck reaction (Figure 10, graph 2 and graph 3).…”
Section: ■ Results and Discussionmentioning
confidence: 99%
“…This model combined with data augmentation techniques showed improved robustness in the low‐data regime and allowed the estimation of the prediction uncertainty 134 . Haywood et al 135 further analyzed the different representations and compared molecular fingerprints with computed descriptors using support vector machine (SVM) and found fingerprints to give better results.…”
Section: Chemical Reaction Tasksmentioning
confidence: 99%
“…The hyperplane is completely dened by the data points that are closest to the plane and between the support vectors. SVM can also be used in mapping the non-separable data through the radial basis function (RBF) kernel by transforming a real space into a higher-dimensional space through several hyperplanes: 33,34 f ðxÞ ¼…”
Section: Machine Learning (Ml) Algorithmsmentioning
confidence: 99%