MoleGuLAR: Molecule Generation using Reinforcement Learning with Alternating Rewards

Goel, Manan

doi:10.33774/chemrxiv-2021-cg9p8

Cited by 5 publications

(7 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…3 Recently, deep learning has been applied towards more efficient methods of sampling chemical space such that it is possible to identify promising candidate molecules faster. Deep generative models using policy based reinforcement learning (RL) [4][5][6][7][8][9][10] , value based RL 11 , learning a molecular latent space 12 , and other methods including tree search 13 and genetic algorithms [14][15][16] have been proposed to generate molecules that possess a desired set of properties. In the policy based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a predefined reward function.…”

Section: Introductionmentioning

confidence: 99%

Improving de novo molecular design with curriculum learning

et al. 2022

View full text Add to dashboard Cite

Reinforcement learning (RL) is a powerful paradigm that has gained popularity across multiple domains.However, applying RL may come at a cost of multiple interactions between the agent and the environment. This cost can be especially pronounced when the single feedback from the environment is slow or computationally expensive, causing extensive periods of nonproductivity. Curriculum learning (CL) provides a suitable alternative by arranging a sequence of tasks of increasing complexity with the aim of reducing the overall cost of learning. Here, we demonstrate the application of CL for drug discovery. We implement CL in the de novo design platform, REINVENT, and apply it on illustrative de novo molecular design problems of different complexity. The results show both accelerated learning and a positive impact on the quality of the output when compared to standard policy based RL. To our knowledge, this is the first application of CL for the purposes of de novo molecular design. The code is freely available at https://github.com/MolecularAI/Reinvent.

show abstract

Section: Introductionmentioning

confidence: 99%

Improving de novo molecular design with curriculum learning

et al. 2022

View full text Add to dashboard Cite

show abstract

“…In this section, we demonstrate that simple curricula, utilizing a single Curriculum Objective can accelerate agent productivity and generate compounds that satisfy a docking constraint, i.e., predicted to retain experimentally validated interactions (see Methods for experiment hyperparameters). 6,7,[13][14][15] Simulating a real-world application where one must allocate limited computational resources, baseline RL and CL performances are compared, given a maximum number of permitted production epochs (300), i.e., epochs that involve docking, as these are relatively computationally demanding. For CL, Curriculum Objectives are first applied to guide the agent and the number of permitted curriculum epochs is not limited, as these are computationally inexpensive (see Table S2).…”

Section: Resultsmentioning

confidence: 99%

“…3 Recently, deep learning has been applied towards more efficient methods of sampling chemical space such that it is possible to identify promising candidate molecules faster. Deep generative models using policy-based reinforcement learning (RL) [4][5][6][7][8][9][10] , value based RL 11 , learning a molecular latent space 12 , and other methods including tree search 13 and genetic algorithms [14][15][16] have been proposed to generate molecules that possess a desired set of properties. In the policy-based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a pre-defined reward function.…”

Section: Introductionmentioning

confidence: 99%

“…In the policy-based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a pre-defined reward function. [4][5][6][7][8][9][10] Often, physics-based approximations of binding affinity such as molecular docking are included as a component in the reward function in order to design candidate molecules with enhanced predicted potency. Given sufficiently long training time, these models can learn to generate molecules which satisfy the desired MPO objective.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Improving De Novo Molecular Design with Curriculum Learning

Guo

Fialková

Arango

et al. 2022

Preprint

View full text Add to dashboard Cite

Reinforcement learning (RL) is a powerful paradigm that has gained popularity across multiple domains. However, applying RL may come at a cost of multiple interactions between the agent and the environment. This cost can be especially pronounced when the single feedback from the environment is slow or computationally expensive, causing extensive periods of nonproductivity. Curriculum learning (CL) provides a suitable alternative by arranging a sequence of tasks of increasing complexity with the aim of reducing the overall cost of learning. Here, we demonstrate the application of CL for drug discovery. We implement CL in the de novo design platform, REINVENT, and apply it on illustrative de novo molecular design problems of different complexity. The results show both accelerated learning and a positive impact on the quality of the output when compared to standard policy based RL. To our knowledge, this is the first application of CL for the purposes of de novo molecular design. The code is freely available at https://github.com/MolecularAI/Reinvent.

show abstract

Section: Introductionmentioning

confidence: 99%

Improving De Novo Molecular Design with Curriculum Learning

Guo

Fialková

Arango

et al. 2021

Preprint

View full text Add to dashboard Cite

Reinforcement learning (RL) is a powerful paradigm that has gained popularity across multiple domains. However, applying RL may come at a cost of multiple interactions between the agent and the environment. This cost can be especially pronounced when the single feedback from the environment is slow or computationally expensive, causing extensive periods of nonproductivity. Curriculum learning (CL) provides a suitable alternative by arranging a sequence of tasks of increasing complexity with the aim of reducing the overall cost of learning. Here, we demonstrate the application of CL for drug discovery. We implement CL in the de novo design platform, REINVENT, and apply it on illustrative de novo molecular design problems of different complexity. The results show both accelerated learning and a positive impact on the quality of the output when compared to standard policy based RL. To our knowledge, this is the first application of CL for the purposes of de novo molecular design. The code is freely available at https://github.com/MolecularAI/Reinvent.

show abstract

MoleGuLAR: Molecule Generation using Reinforcement Learning with Alternating Rewards

Cited by 5 publications

References 0 publications

Improving de novo molecular design with curriculum learning

Improving de novo molecular design with curriculum learning

Improving De Novo Molecular Design with Curriculum Learning

Improving De Novo Molecular Design with Curriculum Learning

Contact Info

Product

Resources

About