2018
DOI: 10.1021/acs.jcim.8b00376
|View full text |Cite
|
Sign up to set email alerts
|

Discovering Highly Potent Molecules from an Initial Set of Inactives Using Iterative Screening

Abstract: The versatility of similarity searching and quantitative structure-activity relationships to model the activity of compound sets within given bioactivity ranges (i.e., interpolation) is well established. However, their relative performance in the common scenario in early stage drug discovery where lots of inactive data but no active data points are available (i.e., extrapolation from the low-activity to the high-activity range) has not been thoroughly examined yet. To this aim, we have designed an iterative vi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
41
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 30 publications
(44 citation statements)
references
References 80 publications
3
41
0
Order By: Relevance
“…We are also happy to clarify that the errors bars in Figs. 3 and 4, and the ± values indicated in the text or in the tables all correspond to the standard deviation over the relevant population, consistent with standard practice [5, 6].…”
Section: In-depth Commentssupporting
confidence: 61%
“…We are also happy to clarify that the errors bars in Figs. 3 and 4, and the ± values indicated in the text or in the tables all correspond to the standard deviation over the relevant population, consistent with standard practice [5, 6].…”
Section: In-depth Commentssupporting
confidence: 61%
“…We firstly compared the performance on the test set of DNN, using dropout probabilities in all layers of either 0.1, 0.25 or 0.5, and RF models ( Figure 2 in line with models reported in the literature for similar data sets 38 . Hence, the models obtained here are likely approaching the upper performance limit which can be obtained for the datasets used, which is also a likely factor behind the very similar performance obtained across methods.…”
Section: Resultsmentioning
confidence: 99%
“…Hence, the model is used to identify data points that lead to the highest information gain (exploration) as opposed to identify newly active data points (exploitation). Previously this approach was shown to lead to a quick improvement in biological activity . Accordingly, using the current and a public dataset, a first PCM SGLT1 screening model was developed that effectively predicted moderately active SGLT1 inhibitors outside the chemical space of the training set .…”
Section: Discussionmentioning
confidence: 99%