Microcrystal electron diffraction (MicroED) has established its complementary role alongside X-ray diffraction in crystal structure elucidation. Unfortunately, kinematical refinement of MicroED data lacks the differentiation power to assign the absolute...
The absolute configuration of a clinically important drug candidate, SMTP-7, with only micron-sized powders available, is directly obtained via microcrystal electron diffraction (MicroED) analysis.
Drug Design Data Resource (D3R) Grand Challenge 4 (GC4) offered a unique opportunity for designing and testing novel methodology for accurate docking and affinity prediction of ligands in an open and blinded manner. We participated in the beta-secretase 1 (BACE) Subchallenge which is comprised of cross-docking and redocking of 20 macrocyclic ligands to BACE and predicting binding affinity for 154 macrocyclic ligands. For this challenge, we developed machine learning models trained specifically on BACE. We developed a deep neural network (DNN) model that used a combination of both structure and ligand-based features that outperformed simpler machine learning models. According to the results released by D3R, we achieved a Spearman's rank correlation coefficient of 0.43 (7) for predicting the affinity of 154 ligands. We describe the formulation of our machine learning strategy in detail. We compared the performance of DNN with linear regression, random forest, and support vector machines using ligand-based, structurebased, and combining both ligand and structure-based features. We compared different structures for our DNN and found that performance was highly dependent on fine optimization of the regularization hyperparameter, alpha. We also developed a novel metric of ligand threedimensional similarity inspired by crystallographic difference density maps to match ligands without crystal structures to similar ligands with known crystal structures. This report demonstrates the detailed parameterization and careful data training and implementation necessary to obtain strong performance with more complex machine learning methods. Our DNN approach tied for fourth in predicting BACE-ligand binding affinities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.