2021
DOI: 10.33774/chemrxiv-2021-cg9p8
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MoleGuLAR: Molecule Generation using Reinforcement Learning with Alternating Rewards

Abstract: Design of new inhibitors for novel targets is a very important problem especially in the current scenario with the world being plagued by COVID-19. Conventional approaches undertaken to this end, like, high-throughput virtual screening require extensive combing through existing datasets in the hope of finding possible matches. In this study we propose a computational strategy for de novo generation of molecules with high binding affinities to the specified target. A deep generative model is built using a stack… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…3 Recently, deep learning has been applied towards more efficient methods of sampling chemical space such that it is possible to identify promising candidate molecules faster. Deep generative models using policy based reinforcement learning (RL) [4][5][6][7][8][9][10] , value based RL 11 , learning a molecular latent space 12 , and other methods including tree search 13 and genetic algorithms [14][15][16] have been proposed to generate molecules that possess a desired set of properties. In the policy based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a predefined reward function.…”
Section: Introductionmentioning
confidence: 99%
“…3 Recently, deep learning has been applied towards more efficient methods of sampling chemical space such that it is possible to identify promising candidate molecules faster. Deep generative models using policy based reinforcement learning (RL) [4][5][6][7][8][9][10] , value based RL 11 , learning a molecular latent space 12 , and other methods including tree search 13 and genetic algorithms [14][15][16] have been proposed to generate molecules that possess a desired set of properties. In the policy based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a predefined reward function.…”
Section: Introductionmentioning
confidence: 99%
“…In this section, we demonstrate that simple curricula, utilizing a single Curriculum Objective can accelerate agent productivity and generate compounds that satisfy a docking constraint, i.e., predicted to retain experimentally validated interactions (see Methods for experiment hyperparameters). 6,7,[13][14][15] Simulating a real-world application where one must allocate limited computational resources, baseline RL and CL performances are compared, given a maximum number of permitted production epochs (300), i.e., epochs that involve docking, as these are relatively computationally demanding. For CL, Curriculum Objectives are first applied to guide the agent and the number of permitted curriculum epochs is not limited, as these are computationally inexpensive (see Table S2).…”
Section: Resultsmentioning
confidence: 99%
“…3 Recently, deep learning has been applied towards more efficient methods of sampling chemical space such that it is possible to identify promising candidate molecules faster. Deep generative models using policy-based reinforcement learning (RL) [4][5][6][7][8][9][10] , value based RL 11 , learning a molecular latent space 12 , and other methods including tree search 13 and genetic algorithms [14][15][16] have been proposed to generate molecules that possess a desired set of properties. In the policy-based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a pre-defined reward function.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…3 Recently, deep learning has been applied towards more efficient methods of sampling chemical space such that it is possible to identify promising candidate molecules faster. Deep generative models using policy based reinforcement learning (RL) [4][5][6][7][8][9][10] , value based RL 11 , learning a molecular latent space 12 , and other methods including tree search 13 and genetic algorithms [14][15][16] have been proposed to generate molecules that possess a desired set of properties. In the policy based RL paradigm, an agent (a generative model) learns a policy (series of actions to take at given states) to generate molecules that maximize a reward which is typically computed based on a predefined reward function.…”
Section: Introductionmentioning
confidence: 99%