2022
DOI: 10.26434/chemrxiv-2022-prz2r
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

Abstract: A plethora of AI-based techniques now exists to conduct de novo molecule generation that can devise molecules conditioned towards a particular endpoint in the context of drug design. One popular approach is using reinforcement learning to update a recurrent neural network or language-based de novo molecule generator. However, reinforcement learning can be inefficient, sometimes requiring up to 10^5 molecules to be sampled to optimize more complex objectives, which poses a limitation when using computationally … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 62 publications
0
5
0
Order By: Relevance
“…The metric used is the area under the curve (AUC) for the top 10 molecules. We note that Thomas et al 43 proposed a modified AUC Top-10 metric that incorporates diversity, but we omit comparison as the formulation can be subjective. The current Top AUC-10 metric assesses sample efficiency, which is our focus.…”
Section: Practical Molecular Optimization (Pmo) Benchmarkmentioning
confidence: 99%
See 2 more Smart Citations
“…The metric used is the area under the curve (AUC) for the top 10 molecules. We note that Thomas et al 43 proposed a modified AUC Top-10 metric that incorporates diversity, but we omit comparison as the formulation can be subjective. The current Top AUC-10 metric assesses sample efficiency, which is our focus.…”
Section: Practical Molecular Optimization (Pmo) Benchmarkmentioning
confidence: 99%
“…We expand the PMO benchmark by adding Augmented Memory and BAR implementations. We further add experience replay to the implemented version of AHC , for comparison.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…the number of oracle calls needed to reliably learn the desired output. REINVENT has been identified as one of the most sample-efficient generative chemical models; both in benchmarks which do not consider compound chemistry relative to the pre-training data 60 as well as benchmarks which do, 60 however the model still requires thousands of oracle evaluations to learn to produce favorable molecules. While this may compare favorably with the cost of brute-force VS on large libraries, the incorporation of higher-cost simulations remains prohibitive.…”
Section: Introductionmentioning
confidence: 99%
“…A real-world interpretation of generative models in the drug discovery context remains difficult, and the current work attempts to better understand this by retrospectively applying performance measures to generative models applied to public and private drug discovery data sources. The objective of the task is hence to achieve late-stage project compounds, given information from early-stage compounds, in a limited number of steps, and hence in a sample-efficient way (for a more detailed recent evaluation of the sample efficiency of different methods see a recent study 18 ). This early/late data split strategy is in analogy to 'time-split' validation in the QSAR area, where splitting data into training and test sets along the time domain has been proposed before 19 .…”
Section: Introductionmentioning
confidence: 99%