A process has been developed whereby libraries of compounds for lead optimization can be synthesized and screened with greater efficiency using computational tools. In this method, analogues of a lead chemical structure are considered in the form of a virtual library. Less than 1/3 of the library is selected as a training set by clustering the compounds and choosing the centroid of each cluster. This training set is then used to generate a model using PLS regression upon the experimental values from that assay using 1D/2D descriptors. The model is applied to the remaining compounds (the test set) for which assay values are predicted and a rank ordering established. An example of this was a set of 169 PDE4 inhibitors. A predictive model was achieved using a training set of 52 compounds. When applied to the remaining 117 compounds this model allowed a rank ordering of these compounds for synthesis and testing. Selecting the top 33 compounds of the test set gives 78% of the compounds with the desired activity (hits) by synthesizing only 50% of the library, including the training set. Selecting the top 59 of the test set gives 97% of the hits from only 67% of the library. This process succeeds by avoiding two principal weaknesses of 2D descriptors: lack of interpretation and lack of extrapolation. Two principal assumptions of QSAR are shown to be unnecessary; removing descriptor redundancy does not improve fit and a predictive r2 greater than 0.5 is not necessary if rank-ordering is desired.
There are many decisions and risks associated with the design and development of new pharmaceutical agents. To help improve decision-making, and reduce the associated risks--prior to synthesis, we have developed interactive web-browser tools for: (i) tracking, searching, clustering and categorizing (by reactive moieties) chemical reactants, (ii) interactively assessing risks, either synthetic--based on prior experience, absorption following oral administration--based on rules of 5, or diversity, and (iii) a complete architecture for enumerating, analyzing, submitting and plating large combinatorial or small biased libraries. We believe the implementation of this highly interactive system has given our scientists a competitive advantage by maintaining their focus on the lowest risk, highest quality molecules throughout the research process.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.