From quantum chemical and experimental NMR data, a 3D graph neural network, CASCADE, has been developed to predict carbon and proton chemical shifts. Stereoisomers and conformers of organic molecules can be correctly distinguished.
Conspectus Machine-readable chemical structure representations are foundational in all attempts to harness machine learning for the prediction of reactivities, selectivities, and chemical properties directly from molecular structure. The featurization of discrete chemical structures into a continuous vector space is a critical phase undertaken before model selection, and the development of new ways to quantitatively encode molecules is an active area of research. In this Account, we highlight the application and suitability of different representations, from expert-guided “engineered” descriptors to automatically “learned” features, in different prediction tasks relevant to organic and organometallic chemistry, where differing amounts of training data are available. These tasks include statistical models of stereo- and enantioselectivity, thermochemistry, and kinetics developed using experimental and quantum chemical data. The use of expert-guided molecular descriptors provides an opportunity to incorporate chemical knowledge, domain expertise, and physical constraints into statistical modeling. In applications to stereoselective organic and organometallic catalysis, where data sets may be relatively small and 3D-geometries and conformations play an important role, mechanistically informed features can be used successfully to obtain predictive statistical models that are also chemically interpretable. We provide an overview of several recent applications of this approach to obtain quantitative models for reactivity and selectivity, where topological descriptors, quantum mechanical calculations of electronic and steric properties, along with conformational ensembles, all feature as essential ingredients of the molecular representations used. Alternatively, more flexible, general-purpose molecular representations such as attributed molecular graphs can be used with machine learning approaches to learn the complex relationship between a structure and prediction target. This approach has the potential to out-perform more traditional representation methods such as “hand-crafted” molecular descriptors, particularly as data set sizes grow. One area where this is particularly relevant is in the use of large sets of quantum mechanical data to train quantitative structure–property relationships. A general approach toward curating useful data sets and training highly accurate graph neural network models is discussed in the context of organic bond dissociation enthalpies, where this strategy outperforms regression using precomputed descriptors. Finally, we describe how graph neural network predictions can be incorporated into mechanistically informed statistical models of chemical reactivity and selectivity. Once trained, this approach avoids the expensive computational overhead associated with quantum mechanical calculations, while maintaining chemical interpretability. We illustrate examples for which fast predictions of bond dissociation enthalpy and of the identities of radicals formed through cleavage of a molecule’s we...
We report that O-selective arylation of 2-and 4-pyridones with arylboronic acids is affected by a modular, bismacycle-based system. The utility of this umpolung approach to pyridyl ethers, which is complementary to conventional methods based on SNAr or crosscoupling, is demonstrated through the concise synthesis of Ki6783 and picolinafen, and the formal synthesis of cabozantib and golvatinib. Computational investigations reveal that arylation proceeds in a concerted fashion via a 5-membered transition state. The kineticallycontrolled regioselectivity for O-arylationwhich is reversed relative to previous Bi(V)-mediated pyridone arylationsis attributed primarily to the geometric constraints imposed by the bismacyclic scaffold.
The importance of modified peptides and proteins for applications in drug discovery, and for illuminating biological processes at the molecular level, is fueling a demand for efficient methods that facilitate the precise modification of these biomolecules. Herein, we describe the development of a photocatalytic method for the rapid and efficient dimerization and site-specific functionalization of peptide and protein diselenides. This methodology, dubbed the photocatalytic diselenide contraction, involves irradiation at 450 nm in the presence of an iridium photocatalyst and a phosphine and results in rapid and clean conversion of diselenides to reductively stable selenoethers. A mechanism for this photocatalytic transformation is proposed, which is supported by photoluminescence spectroscopy and density functional theory calculations. The utility of the photocatalytic diselenide contraction transformation is highlighted through the dimerization of selenopeptides, and by the generation of two families of protein conjugates via the site-selective modification of calmodulin containing the 21st amino acid selenocysteine, and the C-terminal modification of a ubiquitin diselenide.
The catalytic anti-Markovnikov addition of alcohols to simple alkenes is a longstanding synthetic challenge. We recently disclosed the use of organic superbase catalysis for the nucleophilic addition of alcohols to activated styrene derivatives. This article describes mechanistic studies on this reversible reaction, including thermodynamic and kinetic profiling as well as computational modeling. Our findings show the negative entropy of addition is counterbalanced by an enthalpy that is most favored in nonpolar solvents. However, a large negative alcohol rate order under these conditions indicates excess alcohol sequesters the active alkoxide ion pairs, slowing the reaction rate. These observations led to an unexpected solution to a thermodynamically challenging reaction: use of less alcohol enables faster addition, which in turn allows for lower reaction temperatures to counteract Le Chatelier’s principle. Thus, our original method has been improved with new protocols that do not require excess alcohol stoichiometry, enable an expanded alkene substrate scope, and allow for the use of more practical catalyst systems. The generality of this insight for other challenging hydroetherification reactions is also demonstrated through new alkenol cyclization and oxa-Michael addition reactions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.