Virtual Screening System for Finding Structurally Diverse Hits by Active Learning

Fujiwara, Yoshitaka; Yamashita, Yugo; Osoda, Tsutomu; Asogawa, Minoru; Fukushima, Chiaki; Asao, Masaaki; Shimadzu, Hideshi; Nakao, Kazuya; Shimizu, Ryo

doi:10.1021/ci700085q

Cited by 37 publications

(46 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The focus was on activity alone and did not assess questions of structural diversity. Fujiwara et al 18 studied active learning in the context of virtual screening and considered the question of structural diversity. As with the Warmuth study, compound activity was considered as a binary variable and temporal considerations were not taken into account.…”

Section: Resultsmentioning

confidence: 99%

Iterative Refinement of a Binding Pocket Model: Active Computational Steering of Lead Optimization

et al. 2012

View full text Add to dashboard Cite

Computational approaches for binding affinity prediction are most frequently demonstrated through cross-validation within a series of molecules or through performance shown on a blinded test set. Here, we show how such a system performs in an iterative, temporal lead optimization exercise. A series of gyrase inhibitors with known synthetic order formed the set of molecules that could be selected for “synthesis.” Beginning with a small number of molecules, based only on structures and activities, a model was constructed. Compound selection was done computationally, each time making five selections based on confident predictions of high activity and five selections based on a quantitative measure of three-dimensional structural novelty. Compound selection was followed by model refinement using the new data. Iterative computational candidate selection produced rapid improvements in selected compound activity, and incorporation of explicitly novel compounds uncovered much more diverse active inhibitors than strategies lacking active novelty selection.

show abstract

Section: Resultsmentioning

confidence: 99%

Iterative Refinement of a Binding Pocket Model: Active Computational Steering of Lead Optimization

et al. 2012

View full text Add to dashboard Cite

show abstract

“…Thereby, the best use is made of the bioactivity data, while limiting the overall number of assays performed. 7 New compounds are either selected with a focus on maximal information content and diversity of the molecular reference structures (explorative strategy) 8 , 9 or with a focus on improved bioactivity (exploitive/greedy strategy). 10 – 12 Until now, this concept has essentially been studied only theoretically.…”

Section: Introductionmentioning

confidence: 99%

“…Furthermore, we propose a technique for the informed batch-wise selection of compounds, which is of particular practical relevance for the application of active learning in the context of biological studies where many assays are effectively performed in batches. 9 , 13 …”

Section: Introductionmentioning

confidence: 99%

Multi-objective active machine learning rapidly improves structure–activity models and reveals new protein–protein interaction inhibitors

Reker

Schneider

2016

Chem. Sci.

View full text Add to dashboard Cite

show abstract

“…There have been limited previous applications of active learning to the drug discovery process. In these efforts, compound activity was considered to be binary (active or inactive) and effort was focused on only a single target [22,23]. …”

Section: Introductionmentioning

confidence: 99%

Efficient discovery of responses of proteins to compounds using active learning

2014

View full text Add to dashboard Cite

BackgroundDrug discovery and development has been aided by high throughput screening methods that detect compound effects on a single target. However, when using focused initial screening, undesirable secondary effects are often detected late in the development process after significant investment has been made. An alternative approach would be to screen against undesired effects early in the process, but the number of possible secondary targets makes this prohibitively expensive.ResultsThis paper describes methods for making this global approach practical by constructing predictive models for many target responses to many compounds and using them to guide experimentation. We demonstrate for the first time that by jointly modeling targets and compounds using descriptive features and using active machine learning methods, accurate models can be built by doing only a small fraction of possible experiments. The methods were evaluated by computational experiments using a dataset of 177 assays and 20,000 compounds constructed from the PubChem database.ConclusionsAn average of nearly 60% of all hits in the dataset were found after exploring only 3% of the experimental space which suggests that active learning can be used to enable more complete characterization of compound effects than otherwise affordable. The methods described are also likely to find widespread application outside drug discovery, such as for characterizing the effects of a large number of compounds or inhibitory RNAs on a large number of cell or tissue phenotypes.

show abstract

Virtual Screening System for Finding Structurally Diverse Hits by Active Learning

Cited by 37 publications

References 18 publications

Iterative Refinement of a Binding Pocket Model: Active Computational Steering of Lead Optimization

Iterative Refinement of a Binding Pocket Model: Active Computational Steering of Lead Optimization

Multi-objective active machine learning rapidly improves structure–activity models and reveals new protein–protein interaction inhibitors

Efficient discovery of responses of proteins to compounds using active learning

Contact Info

Product

Resources

About