Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

Thomas, Morgan; O’Boyle, Noel M.; Bender, Andreas; Graaf, Chris de

doi:10.26434/chemrxiv-2022-prz2r

Cited by 4 publications

(5 citation statements)

References 62 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The metric used is the area under the curve (AUC) for the top 10 molecules. We note that Thomas et al 43 proposed a modified AUC Top-10 metric that incorporates diversity, but we omit comparison as the formulation can be subjective. The current Top AUC-10 metric assesses sample efficiency, which is our focus.…”

Section: Practical Molecular Optimization (Pmo) Benchmarkmentioning

confidence: 99%

“…We expand the PMO benchmark by adding Augmented Memory and BAR implementations. We further add experience replay to the implemented version of AHC , for comparison.…”

Section: Introductionmentioning

confidence: 99%

“…This is demonstrated in an exploitation task, the PMO benchmark, and in both drug and materials design case studies. We propose a method called Selective Memory Purge, which can be used in conjunction with Augmented Memory to generate diverse molecules while retaining enhanced sample efficiency. We explicitly highlight the importance of experience replay on the sample efficiency of REINVENT and all proposed algorithmic modifications. We expand the PMO benchmark by adding Augmented Memory and BAR implementations. We further add experience replay to the implemented version of AHC , for comparison. …”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning

Guo,

Schwaller

2024

JACS Au

View full text Add to dashboard Cite

Sample efficiency is a fundamental challenge in de novo molecular design. Ideally, molecular generative models should learn to satisfy a desired objective under minimal calls to oracles (computational property predictors). This problem becomes more apparent when using oracles that can provide increased predictive accuracy but impose significant computational cost. Consequently, designing molecules that are optimized for such oracles cannot be achieved under a practical computational budget. Molecular generative models based on simplified molecular-input line-entry system (SMILES) have shown remarkable sample efficiency when coupled with reinforcement learning, as demonstrated in the practical molecular optimization (PMO) benchmark. Here, we first show that experience replay drastically improves the performance of multiple previously proposed algorithms. Next, we propose a novel algorithm called Augmented Memory that combines data augmentation with experience replay. We show that scores obtained from oracle calls can be reused to update the model multiple times. We compare Augmented Memory to previously proposed algorithms and show significantly enhanced sample efficiency in an exploitation task, a drug discovery case study requiring both exploration and exploitation, and a materials design case study optimizing explicitly for quantummechanical properties. Our method achieves a new state-of-the-art in sample-efficient de novo molecular design, outperforming all of the previously reported methods. The code is available at https://github.com/schwallergroup/augmented_memory.

show abstract

Section: Practical Molecular Optimization (Pmo) Benchmarkmentioning

confidence: 99%

“…We expand the PMO benchmark by adding Augmented Memory and BAR implementations. We further add experience replay to the implemented version of AHC , for comparison.…”

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning

Guo,

Schwaller

2024

JACS Au

View full text Add to dashboard Cite

show abstract

“…the number of oracle calls needed to reliably learn the desired output. REINVENT has been identified as one of the most sample-efficient generative chemical models; both in benchmarks which do not consider compound chemistry relative to the pre-training data 60 as well as benchmarks which do, 60 however the model still requires thousands of oracle evaluations to learn to produce favorable molecules. While this may compare favorably with the cost of brute-force VS on large libraries, the incorporation of higher-cost simulations remains prohibitive.…”

Section: Introductionmentioning

confidence: 99%

Sample efficient reinforcement learning with active learning for molecular design

Dodds,

Guo,

Löhr

et al. 2024

Chem. Sci.

View full text Add to dashboard Cite

show abstract

“…A real-world interpretation of generative models in the drug discovery context remains difficult, and the current work attempts to better understand this by retrospectively applying performance measures to generative models applied to public and private drug discovery data sources. The objective of the task is hence to achieve late-stage project compounds, given information from early-stage compounds, in a limited number of steps, and hence in a sample-efficient way (for a more detailed recent evaluation of the sample efficiency of different methods see a recent study 18 ). This early/late data split strategy is in analogy to 'time-split' validation in the QSAR area, where splitting data into training and test sets along the time domain has been proposed before 19 .…”

Section: Introductionmentioning

confidence: 99%

On The Difficulty of Validating Molecular Generative Models Realistically: A Case Study on Public and Proprietary Data

Handa

Thomas

Kageyama

et al. 2023

Preprint

View full text Add to dashboard Cite

While a multitude of deep generative models have recently emerged there exists no best practice for their practically relevant validation. On the one hand, novel de novo-generated molecules cannot be refuted by retrospective validation (so that this type of validation is biased); but on the other hand prospective validation is expensive and then often biased by the human selection process. In this case study, we frame retrospective validation as the ability to mimic human drug design, by answering the following question: Can a generative model trained on early-stage project compounds generate middle/late-stage compounds de novo? To this end, we used experimental data that contains the elapsed time of a synthetic expansion following hit identification from five public (where the time series was pre-processed to better reflect realistic synthetic expansions) and six in-house project datasets, and used REINVENT as a widely adopted RNN-based generative model. After splitting the dataset and training REINVENT on early-stage compounds, we found that rediscovery of middle/late-stage compounds was much higher in public projects (at 1.60%, 0.64%, and 0.21% of the top 100, 500, and 5,000 scored generated compounds) than in in-house projects (where the values were 0.00%, 0.03%, and 0.04%, respectively). Similarly, average single nearest neighbour similarity between early- and middle/late-stage compounds in public projects was higher between active compounds than inactive compounds; however, for in-house projects the converse was true, which makes rediscovery (if so desired) more difficult. We hence show that the generative model recovers very few middle/late-stage compounds from real-world drug discovery projects, highlighting the fundamental difference between purely algorithmic design and drug discovery as a real-world process. Evaluating de novo compound design approaches appears, based on the current study, difficult or even impossible to do retrospectively.

show abstract

Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

Cited by 4 publications

References 62 publications

Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning

Augmented Memory: Sample-Efficient Generative Molecular Design with Reinforcement Learning

Sample efficient reinforcement learning with active learning for molecular design

On The Difficulty of Validating Molecular Generative Models Realistically: A Case Study on Public and Proprietary Data

Contact Info

Product

Resources

About