Reaction databases provide a great deal of useful information to assist planning of experiments but do not provide any interpretation or chemical concepts to accompany this information. In this work, reactions are labeled with experimental conditions, and network analysis shows that consistencies within clusters of data points can be leveraged to organize this information. In particular, this analysis shows how particular experimental conditions (specifically solvent) are effective in enabling specific organic reactions (Friedel−Crafts, Aldol addition, Claisen condensation, Diels−Alder, and Wittig), including variations within each reaction class. Network analysis shows data points for reactions tend to break into clusters that depend on the catalyst and chemical structure. This type of clustering, which mimics how a chemist reasons, is derived directly from the network. Therefore, the findings of this work could augment synthesis planning by providing predictions in a fashion that mimics human chemists. To numerically evaluate solvent prediction ability, three methods are compared: network analysis (through the k-nearest neighbor algorithm), a support vector machine, and a deep neural network. The most accurate method in 4 of the 5 test cases is the network analysis, with deep neural networks also showing good prediction scores. The network analysis tool was evaluated by an expert panel of chemists, who generally agreed that the algorithm produced accurate solvent choices while simultaneously being transparent in the underlying reasons for its predictions.
Conical intersections (CIs) are important features of photochemistry that determine yields and selectivity. Traditional CI optimizers require significant human effort and chemical intuition, which typically restricts searching to only a small region of the CI space. Herein, a systematic approach utilizing the growing string method is introduced to locate multiple CIs. Unintuitive MECI are found using driving coordinates that can be generated using a combinatorial search, and subsequent optimization allows reaction pathways, transition states, products, and seam-space pathways to be located. These capabilities are demonstrated by application to two prototypical photoisomerization reactions and the dimerization of butadiene. In total, many reaction pathways were uncovered, including the elusive stilbene hula-twist mechanism, and a previously unidentified product in butadiene dimerization. Overall, these results suggest that growing string methods provide a predictive strategy for exploring photochemistry.
Transfer and active learning have the potential to accelerate the development of new chemical reactions, using prior data and new experiments to inform models that adapt to the target area...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.