VirtualFlow Ants—Ultra-Large Virtual Screenings with Artificial Intelligence Driven Docking Algorithm Based on Ant Colony Optimization

Gorgulla, Christoph; Çınaroğlu, Süleyman Selim; Fischer, Patrick D.; Fackeldey, Konstantin; Wagner, Gerhard; Arthanari, Haribabu

doi:10.3390/ijms22115807

Cited by 25 publications

(18 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The first such screen was performed by OpenEye Scientific on their Orion cloud computing platform, leveraging vast quantities of CPUs to explicitly predict the pose and score of each compound (detailed experimental outcomes were not provided). Several billion compound screens have been performed with VirtualFlow, a workflow that relies on less-expensive, and arguably less-accurate, methods to triage compounds. ,, Docking an ultralarge library of one billion compounds on 1000 CPUs would take ∼11 days with DOCK3.7 (at 1 s/lig), 300 days with Glide SP (at 30 s/lig), and 173 days with VirtualFlow (at 15 s/lig). This does not account for the cost to prepare the library for screening, which typically includes the generation of low-energy ionization and tautomeric states, as well as stereochemical states for unspecified centers.…”

Section: Introductionmentioning

confidence: 99%

Efficient Exploration of Chemical Space with Docking and Deep Learning

Yang

Yao

Repasky

et al. 2021

J. Chem. Theory Comput.

120

111

View full text Add to dashboard Cite

With the advent of make-on-demand commercial libraries, the number of purchasable compounds available for virtual screening and assay has grown explosively in recent years, with several libraries eclipsing one billion compounds. Today’s screening libraries are larger and more diverse, enabling the discovery of more-potent hit compounds and unlocking new areas of chemical space, represented by new core scaffolds. Applying physics-based in silico screening methods in an exhaustive manner, where every molecule in the library must be enumerated and evaluated independently, is increasingly cost-prohibitive. Here, we introduce a protocol for machine learning-enhanced molecular docking based on active learning to dramatically increase throughput over traditional docking. We leverage a novel selection protocol that strikes a balance between two objectives: (1) identifying the best scoring compounds and (2) exploring a large region of chemical space, demonstrating superior performance compared to a purely greedy approach. Together with automated redocking of the top compounds, this method captures almost all the high scoring scaffolds in the library found by exhaustive docking. This protocol is applied to our recent virtual screening campaigns against the D4 and AMPC targets that produced dozens of highly potent, novel inhibitors, and a blind test against the MT1 target. Our protocol recovers more than 80% of the experimentally confirmed hits with a 14-fold reduction in compute cost, and more than 90% of the hit scaffolds in the top 5% of model predictions, preserving the diversity of the experimentally confirmed hit compounds.

show abstract

Section: Introductionmentioning

confidence: 99%

Efficient Exploration of Chemical Space with Docking and Deep Learning

Yang

Yao

Repasky

et al. 2021

J. Chem. Theory Comput.

120

111

View full text Add to dashboard Cite

show abstract

“…Software. Software platforms, such as VirtualFlow (3,19,20), that can screen ultra-large ligand libraries using the above-mentioned computational resources, including cloud computing infrastructure, became freely available.…”

Section: Docking-based Ultra-large Virtual Screeningsmentioning

confidence: 99%

“…One of the features of VFVS is that it supports a variety of external docking programs, such as AutoDock Vina (50), Smina (51), QuickVina 2 (52), QuickVina-W (53), Vina-Carb (54), or VinaXB (55). Later GWO Vina and PLANTS were added (19,20). Most of these docking programs have special features, such as blind-docking capabilities (QuickVina-W), or enhanced accuracy for docking carbohydrates (Vina-Carb).…”

Section: Vfvs Vfvsmentioning

confidence: 99%

Recent Developments in Structure-Based Virtual Screening Approaches

Gorgulla¹

2022

Preprint

View full text Add to dashboard Cite

Drug development is a wide scientific field that faces many challenges these days. Among them are extremely high development costs, long development times, as well as a low number of new drugs that are approved each year. To solve these problems, new and innovate technologies are needed that make the drug discovery process of smallmolecules more time and cost-efficient, and which allow to target previously undruggable target classes such as protein-protein interactions. Structure-based virtual screenings have become a leading contender in this context. In this review, we give an introduction to the foundations of structure-based virtual screenings, and survey their progress in the past few years. We outline key principles, recent success stories, new methods, available software, and promising future research directions.Virtual screenings have an enormous potential for the development of new small-molecule drugs, and are already starting to transform earlystage drug discovery.

show abstract

“…Since 2015, the number of commercially accessible compounds has grown by over three orders of magnitude, leading to new opportunities to discover new chemistry for new biology. Ultra large scale docking has now been applied to over a dozen targets [1][2][3][4][5][6][7][8][9][10][11] and has discovered new compounds with activities often in the nM and occasionally in the sub-nM range 1,3,6,12 , often leading to molecules with interesting in vivo activities 3,6,12,13 . A challenge to making this approach widely accessible has been the very size of the new libraries, where over 40 billion tangible molecules have been enumerated, and over 4.5 billion of these have structures calculated that are suitable for docking.…”

Section: Introductionmentioning

confidence: 99%

Large Scale Docking in the Cloud

Irwin

Tingle

2023

Preprint

View full text Add to dashboard Cite

Molecular docking is a pragmatic approach to exploit protein structure for new ligand discovery, but the growing size of available chemical space is increasingly challenging to screen on in-house computer clusters. We have therefore developed AWS-DOCK, a protocol for running UCSF DOCK in the AWS cloud. Our approach leverages the low cost and scalability of cloud resources combined with a low-molecule-cost docking engine to screen billions of molecules efficiently. We benchmarked our system by screening 50 million HAC 22 molecules against the DRD4 receptor. We saw up to 3-fold variations in cost between AWS availability zones. Docking 4.5 billion lead-like molecules, a 7-week calculation on our 1000-core lab cluster, runs in less than a week in AWS for around $25,000, less than the cost of two new nodes. The cloud docking protocol is described in easy-to-follow steps and may be sufficiently general to be used for other docking programs. All the tools to enable AWS-DOCK are available free to everyone, while DOCK 3.8 is free for academic research.

show abstract

VirtualFlow Ants—Ultra-Large Virtual Screenings with Artificial Intelligence Driven Docking Algorithm Based on Ant Colony Optimization

Cited by 25 publications

References 58 publications

Efficient Exploration of Chemical Space with Docking and Deep Learning

Efficient Exploration of Chemical Space with Docking and Deep Learning

Recent Developments in Structure-Based Virtual Screening Approaches

Large Scale Docking in the Cloud

Contact Info

Product

Resources

About