2021
DOI: 10.1021/acscentsci.1c00535
|View full text |Cite
|
Sign up to set email alerts
|

The Evolution of Data-Driven Modeling in Organic Chemistry

Abstract: Organic chemistry is replete with complex relationships: for example, how a reactant’s structure relates to the resulting product formed; how reaction conditions relate to yield; how a catalyst’s structure relates to enantioselectivity. Questions like these are at the foundation of understanding reactivity and developing novel and improved reactions. An approach to probing these questions that is both longstanding and contemporary is data-driven modeling. Here, we provide a synopsis of the history of data-driv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
86
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 93 publications
(96 citation statements)
references
References 136 publications
(246 reference statements)
0
86
0
Order By: Relevance
“…9,10 Computational approaches help in identifying structural aspects pertinent to reactivity and inform the design of improved catalysts. [11][12][13][14][15][16] A strategy popularized by Sigman and co-workers is to correlate physical organic descriptors to experimental activity or selectivity outcomes via multivariate regression analysis. [17][18][19][20][21][22][23][24][25] not fully exploit the fragment-based nature of organocatalysts and has limited transferability because, when even small modifications on part of the catalyst are made, its entire structure must be re-optimized and the parameters collected.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…9,10 Computational approaches help in identifying structural aspects pertinent to reactivity and inform the design of improved catalysts. [11][12][13][14][15][16] A strategy popularized by Sigman and co-workers is to correlate physical organic descriptors to experimental activity or selectivity outcomes via multivariate regression analysis. [17][18][19][20][21][22][23][24][25] not fully exploit the fragment-based nature of organocatalysts and has limited transferability because, when even small modifications on part of the catalyst are made, its entire structure must be re-optimized and the parameters collected.…”
Section: Introductionmentioning
confidence: 99%
“…9,10 Computational approaches help in identifying structural aspects pertinent to reactivity and inform the design of improved catalysts. 11–16 A strategy popularized by Sigman and co-workers is to correlate physical organic descriptors to experimental activity or selectivity outcomes via multivariate regression analysis. 17–25 When applied to organocatalysts, the descriptors are typically evaluated on the catalyst structure as a whole or, occasionally, from truncated versions of it.…”
Section: Introductionmentioning
confidence: 99%
“…3B, while HTE data cover a narrow and homogeneous chemical space (Fig. 2A and 1A-B) and are devoid of experimental and reporting bias 13,24,31,32 . Unlike published reaction data, HTE systematically reports yields for all reactions including low yields, and is comprised of reactions performed in the same experimental settings.…”
Section: Benchmark Of Machine Learning Models On Nicolitmentioning
confidence: 99%
“…Machine learning-based data-driven approaches have attracted tremendous interest recently due to their potential to change the conventional reaction development processes. 1 Although classification 2 and clustering 3 techniques have been applied to analyze molecular catalysis/reactivities of transition metal complexes, regression analysis between reaction outcomes ( e.g. , enantioselectivity) and molecular descriptors is one of the central foci in data-driven approaches for the design and optimization of molecular catalysis.…”
Section: Introductionmentioning
confidence: 99%