2018
DOI: 10.1186/s13321-018-0265-z
|View full text |Cite
|
Sign up to set email alerts
|

Efficient iterative virtual screening with Apache Spark and conformal prediction

Abstract: BackgroundDocking and scoring large libraries of ligands against target proteins forms the basis of structure-based virtual screening. The problem is trivially parallelizable, and calculations are generally carried out on computer clusters or on large workstations in a brute force manner, by docking and scoring all available ligands.ContributionIn this study we propose a strategy that is based on iteratively docking a set of ligands to form a training set, training a ligand-based model on this set, and predict… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
36
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 35 publications
(36 citation statements)
references
References 28 publications
(32 reference statements)
0
36
0
Order By: Relevance
“…However, what compounds should be picked at each iteration still remains a largely unresolved question. Hence, learning patterns in the screening data using artificial intelligence to increase hit rates, and hence discover active molecules faster and more efficiently, is gaining increasing attention [19][20][21][22] .…”
Section: Introductionmentioning
confidence: 99%
“…However, what compounds should be picked at each iteration still remains a largely unresolved question. Hence, learning patterns in the screening data using artificial intelligence to increase hit rates, and hence discover active molecules faster and more efficiently, is gaining increasing attention [19][20][21][22] .…”
Section: Introductionmentioning
confidence: 99%
“…Such new services (comprising models for new receptors) can be deployed in a similar way as shown for the reference instance on Kubernetes (code and instructions available on [41]). In the supplementary material we show how users can build models using our previous method [37] and then use the models to create service for a new receptor. Instructions are provided to deploy and add the Docker container for a new receptor to the service [39].…”
Section: Discussionmentioning
confidence: 99%
“…To confront this challenge, back in 2006 we introduced Progressive Docking -a hybrid docking/ machine learning approach utilizing 3D 'inductive' QSAR (quantitative structure-activity relationship) descriptors [8][9][10] to filter out molecules predicted to have unfavorable Glide docking scores 11 . While this method resulted in 3-4 fold enrichment of virtual hits (as several similar approaches reported more recently 12,13 ), the Progressive Docking did not gain a particular momentum. For that we could contemplate two possible reasons -on one hand, back in 2006 the ZINC database contained only ~1 million entries, which were amendable to conventional docking.…”
mentioning
confidence: 88%