2022
DOI: 10.1021/acscatal.2c01741
|View full text |Cite
|
Sign up to set email alerts
|

Mechanistic Inference from Statistical Models at Different Data-Size Regimes

Abstract: The chemical sciences are witnessing an influx of statistics into the catalysis literature. These developments are propelled by modern technological advancements that are leading to fast and reliable data production, mining, and management. In organic chemistry, models encoded with information-rich parameters have facilitated the formulation of mechanistic hypotheses across different data-size regimes. Herein, we aim to demonstrate through selected examples that the integration of statistical principles into h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(18 citation statements)
references
References 229 publications
0
18
0
Order By: Relevance
“…Not only do the reactions use different transition metals (i.e., Pd vs Rh), but the coordination geometries of putative reaction intermediates are also distinct. While the Heck reaction primarily involves square-planar Pd species, the hydroformylation reaction is thought to proceed through trigonal bipyramidal Rh intermediates …”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Not only do the reactions use different transition metals (i.e., Pd vs Rh), but the coordination geometries of putative reaction intermediates are also distinct. While the Heck reaction primarily involves square-planar Pd species, the hydroformylation reaction is thought to proceed through trigonal bipyramidal Rh intermediates …”
Section: Resultsmentioning
confidence: 99%
“…As showcased below, the designer maps can be applied to various downstream data science steps in multi-objective optimization campaigns. In addition to the reaction optimization applications presented (vide infra), it can be envisioned that this calculated parameter library and chemical space map could be used for many other potential applications such as training set design, novel ligand generation, and mechanistic understanding …”
Section: Resultsmentioning
confidence: 99%
“…This data-size regime engenders fundamental restrictions in ML. [13] For example, many assumptions that are valid with big data, such as subtracting the mean and dividing by the standard deviation, tend to create artifacts. Familiarity with these issues defines the space of permissible statistical tools.…”
Section: Variance and Biasmentioning
confidence: 99%
“…In addition, the high-quality data sets generated by HTE facilitate the use of statistical modeling techniques. Such tools can be leveraged to interrogate which features of the ligand dictate reactivity and can thus inform the design of future ligands. Despite these advantages, it is relatively uncommon for C–N cross-coupling methods to be optimized for HTE compatibility, which may require the use of high boiling, polar solvents, and relatively dilute conditions (0.1 M or less).…”
Section: Introductionmentioning
confidence: 99%