Synergy Between Expert and Machine‐Learning Approaches Allows for Improved Retrosynthetic Planning

Badowski, Tomasz; Gajewska, Ewa; Molga, Karol; Grzybowski, Bartosz A.

doi:10.1002/ange.201912083

Cited by 18 publications

(12 citation statements)

References 24 publications

(44 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…[21][22][23][24][25] Significant effort has also been invested in computer-aided synthesis planning (CASP) 26 and the development of retrosynthetic design algorithms. [27][28][29][30] To supplement these tools, initial attempts have been made to predict reaction conditions in the forward direction based on the substrates and products involved. 31 Thus far, studies have focused on global datasets with millions of data points of mixed reaction types.…”

Section: Introductionmentioning

confidence: 99%

Multilabel Classification Models for the Prediction of Cross-Coupling Reaction Conditions

Maser

Cui

Ryou

et al. 2021

J. Chem. Inf. Model.

View full text Add to dashboard Cite

Machine-learned ranking models have been developed for the prediction of substrate-specific cross-coupling reaction conditions. Datasets of published reactions were curated for Suzuki, Negishi, and C-N couplings, as well as Pauson-Khand reactions. String, descriptor, and graph encodings were tested as input representations, and models were trained to predict the set of conditions used in a reaction as a binary vector.Unique reagent dictionaries categorized by expert-crafted reaction roles were constructed for each dataset, leading to context-aware predictions. We find that relational graph convolutional networks and gradient-boosting machines are very effective for this learning task, and we disclose a novel reaction-level graph-attention operation in the top-performing model. File list (2)download file view on ChemRxiv 2020-10-13_ChemRxiv.pdf (2.25 MiB) download file view on ChemRxiv 2020-10-13_ChemRxiv_SI.pdf (3.28 MiB)

show abstract

Section: Introductionmentioning

confidence: 99%

Multilabel Classification Models for the Prediction of Cross-Coupling Reaction Conditions

Maser

Cui

Ryou

et al. 2021

J. Chem. Inf. Model.

View full text Add to dashboard Cite

show abstract

“…The composition-dependent kinetics and thermodynamics limit the design of thermal processing protocols to obtain the desired thin film composition and phase. Models for predicting synthesis protocols are being developed, for example via machine learning from the literature for solid state materials 13 and combining machine learning with human insights 14 and symbolic AI 15 for molecular synthesis. While these powerful approaches will undoubtedly be useful in specific settings, the proposed synthesis of thin film oxysulfide alloys is not amenable to synthesis design by such models at this time, due to the lack of relevant training data, both with respect to the La-Bi-Cu oxysulfide chemistry and with respect to the thin film format of complex oxysulfides.…”

Section: Introductionmentioning

confidence: 99%

Combinatorial Synthesis of Oxysulfides in the Lanthanum–Bismuth-Copper System

et al. 2020

View full text Add to dashboard Cite

Establishing synthesis methods for a target material comprises a grand challenge in materials research, which is compounded with use-inspired specifications on the format of the material. Solar photochemistry using thin film materials is a promising technology for which many complex materials are being proposed, and the present work describes application of combinatorial methods to explore the synthesis of predicted La-Bi-Cu oxysulfide photocathodes, in particular alloys of LaCuOS and BiCuOS. The variation in concentration of 3 cations and 2 anions in thin film materials, and crystallization thereof, is achieved by a combination of reactive sputtering and thermal processes including reactive annealing and rapid thermal processing. Composition and structural characterization establish composition-processing-structure relationships that highlight the breadth of processing conditions required for synthesis of LaCuOS and BiCuOS. The relative irreducibility of La oxides and limited diffusion indicate the need for high temperature processing, which conflicts with the temperature limits for mitigating evaporation of Bi and S. Collectively the results indicate that alloys of these phases will require reactive annealing protocols that are uniquely tailored to each composition, motivating advancement of dynamic processing capabilities to further automate discovery of synthesis routes.

show abstract

“…In addition, the failure of the worldwide logistics and supply chains that accompanies COVID-19 pandemic might render some key substrates temporarily unavailable, in effect delaying execution of the proven synthetic routes and calling for alternative synthetic solutions. Anticipating such complications, we harnessed the power of Chematica [8][9][10][11][12][13][14][15][16] -an experimentally-tested 9,10 platform for computer-assisted retrosynthesis of both known and unknown target molecules -to design syntheses of HCQ that would (1) commence from various inexpensive and popular starting materials (so that the syntheses minimize the abovementioned supply problems); (2) circumvent patented methodologies whenever possible 16 ; and (3) minimize the use of expensive methodologies and/or reagents. In the following, we briefly outline the computational methods underlying Chematica's retrosynthetic searches, summarize the known syntheses of HCQ, and then describe novel ones identified by Chematica to meet conditions (1)- (3).…”

mentioning

confidence: 99%

“…Chematica is a sophisticated platform for fully automated design of pathways leading to arbitrary (i.e., both known and new) targets. The software combines elements of network theory 16,17 with an expert knowledge-base of synthetic transformations as well as multiple reaction-evaluation routines (based on machine learning, 11,12 quantum mechanics, 8,9 and molecular dynamics 9,13 ) to search over vast trees of synthetic possibilities. The reaction transforms (currently, ~ 100,000) are expert-coded based on the underlying reaction mechanisms and are broader than any specific literature precedents (for comparison with machine extraction of rules from reaction repositories, see 13 ).…”

mentioning

confidence: 99%

“…Each rule specifies the scope of admissible substituents, accounts for stereo-and regiochemistry requirements, recognizes groups that must be protected under given reaction conditions, and identifies functionalities that are outright incompatible. The searches are guided by combinations of functions (either heuristic 8,9 or best-in-class AI-based 12 ) that score both synthetic positions as well as costs of individual reactions. The pathways identified by the program terminate in either commercially available chemicals (here, more than 200,000 molecules from Sigma-Aldrich catalogs, each with price per unit quantity; also see below for price re-scaling) or those already known in the literature (ca.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Computer-Assisted Planning of Hydroxychloroquine’s Syntheses Commencing from Inexpensive Substrates and Bypassing Patented Routes.

Szymkuć¹,

Gajewska²,

Molga³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

A computer program for retrosynthetic planning helps develop multiple "synthetic contingency" plans for hydroxychloroquine, a promising but yet unproven medication against COVID-19. These plans are designed to navigate, as much as possible, around known and patented routes and to commence from inexpensive and diverse starting materials, such as to ensure supply in case of anticipated market shortages of the commonly used substrates.

show abstract

Synergy Between Expert and Machine‐Learning Approaches Allows for Improved Retrosynthetic Planning

Cited by 18 publications

References 24 publications

Multilabel Classification Models for the Prediction of Cross-Coupling Reaction Conditions

Multilabel Classification Models for the Prediction of Cross-Coupling Reaction Conditions

Combinatorial Synthesis of Oxysulfides in the Lanthanum–Bismuth-Copper System

Computer-Assisted Planning of Hydroxychloroquine’s Syntheses Commencing from Inexpensive Substrates and Bypassing Patented Routes.

Contact Info

Product

Resources

About