2022
DOI: 10.1038/s42256-022-00494-4
|View full text |Cite|
|
Sign up to set email alerts
|

Improving de novo molecular design with curriculum learning

Abstract: Reinforcement learning (RL) is a powerful paradigm that has gained popularity across multiple domains.However, applying RL may come at a cost of multiple interactions between the agent and the environment. This cost can be especially pronounced when the single feedback from the environment is slow or computationally expensive, causing extensive periods of nonproductivity. Curriculum learning (CL) provides a suitable alternative by arranging a sequence of tasks of increasing complexity with the aim of reducing … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
37
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(45 citation statements)
references
References 43 publications
(67 reference statements)
0
37
0
Order By: Relevance
“…We acknowledge that alternative methods can be used to improve the sample-efficiency of RL [ 108 ]. For example, experience replay can be used to remind the agent of ‘good’ molecules [ 58 , 108 ], a margin guard [ 109 ] can be employed to dynamically change α during RL updates or curriculum learning can be used to accelerate learning by breaking the objective into a sequence of simpler tasks [ 110 ]. We are of the opinion that AHC is a more direct and principled approach to improve sample-efficiency and could even be used in combination with these methods to potentially further improve reinforcement learning for de novo molecule optimization.…”
Section: Resultsmentioning
confidence: 99%
“…We acknowledge that alternative methods can be used to improve the sample-efficiency of RL [ 108 ]. For example, experience replay can be used to remind the agent of ‘good’ molecules [ 58 , 108 ], a margin guard [ 109 ] can be employed to dynamically change α during RL updates or curriculum learning can be used to accelerate learning by breaking the objective into a sequence of simpler tasks [ 110 ]. We are of the opinion that AHC is a more direct and principled approach to improve sample-efficiency and could even be used in combination with these methods to potentially further improve reinforcement learning for de novo molecule optimization.…”
Section: Resultsmentioning
confidence: 99%
“…Baselines. We used three well-established models, consisting of RationaleRL, 85 REINVENT, 91 and GB-GA, which exhibit impressive performance on the multi-constraints molecular design task, as the baselines in this study.…”
Section: ■ Methodsmentioning
confidence: 99%
“…Generally, the physical and chemical properties and biological activity are used as constraints of the molecular generation models 9,10 to guide molecular design through reinforcement learning, 11,12 transfer learning, 13 or curriculum learning. 14 To obtain molecules with biological activity, new generated molecules are further evaluated and selected generally through molecular docking software such as Schrodinger, 15 AutoDock Vina, 16 etc. 17 However, such approaches merely perform the docking simulation after the molecular generation model 18 but do not optimize the binding activity to a target in the process of molecular generation.…”
Section: ■ Introductionmentioning
confidence: 99%