2018
DOI: 10.1246/cl.180665
|View full text |Cite
|
Sign up to set email alerts
|

Population-based De Novo Molecule Generation, Using Grammatical Evolution

Abstract: Automatic design with machine learning and molecular simulations has shown a remarkable ability to generate new and promising drug candidates. Current models, however, still have problems in simulation concurrency and molecular diversity. Most methods generate one molecule at a time and do not allow multiple simulators to run simultaneously. Additionally, better molecular diversity could boost the success rate in the subsequent drug discovery process. We propose a new population-based approach using grammatica… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
89
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 100 publications
(94 citation statements)
references
References 22 publications
0
89
0
Order By: Relevance
“…13,25 Yoshikawa et al proposed a method that evolves string molecular representations using mutations exploiting the SMILES context-free grammar. 87 For each goal-directed benchmark the 300 highest scoring molecules in the dataset are selected as the initial population. Each molecule is represented by 300 genes.…”
Section: Smiles Gamentioning
confidence: 99%
“…13,25 Yoshikawa et al proposed a method that evolves string molecular representations using mutations exploiting the SMILES context-free grammar. 87 For each goal-directed benchmark the 300 highest scoring molecules in the dataset are selected as the initial population. Each molecule is represented by 300 genes.…”
Section: Smiles Gamentioning
confidence: 99%
“…158 Since then, the development of better reward functions has greatly helped to mitigate such issues, but low diversity and novelty remains an issue. [159][160][161] After reviewing the work that has been done so far on reward function design, we conclude that good reward functions should lead to generated molecules which meet the following desiderata:…”
Section: Reward Function Designmentioning
confidence: 99%
“…The literature concerning generative models of molecules has exploded since the first work on the topic Gómez-Bombarelli et al [2018]. Current methods feature molecular representations such as SMILES [Janz et al, 2018, Segler et al, 2017, Skalic et al, 2019, Ertl et al, 2017, Lim et al, 2018, Kang and Cho, 2018, Sattarov et al, 2019, Gupta et al, 2018, Harel and Radinsky, 2018, Yoshikawa et al, 2018, Bjerrum and Sattarov, 2018, Mohammadi et al, 2019 and graphs [Simonovsky and Komodakis, 2018, Li et al, 2018a, De Cao and Kipf, 2018, Kusner et al, 2017, Dai et al, 2018, Samanta et al, 2019, Li et al, 2018b, Kajino, 2019, Jin et al, 2019, Bresson and Laurent, 2019, Lim et al, 2019, Pölsterl and Wachinger, 2019, Krenn et al, 2019, Maziarka et al, 2019, Madhawa et al, 2019, Shen, 2018, Korovina et al, 2019 In this section we conduct an empirical test of the hypothesis from [Gómez-Bombarelli et al, 2018] that the decoder's lack of efficiency is due to data point collection in "dead regions" of the latent space far from the data on which the VAE was trained. We use this information to construct a binary classification Bayesian Neural Network (BNN) to serve as a constraint function that outputs the probability of a latent point being valid, the details of which will be discussed in the section on labelling criteria.…”
Section: Related Workmentioning
confidence: 99%