2019
DOI: 10.1021/acs.jcim.8b00839
|View full text |Cite
|
Sign up to set email alerts
|

GuacaMol: Benchmarking Models for de Novo Molecular Design

Abstract: De novo design seeks to generate molecules with required property profiles by virtual design-make-test cycles. With the emergence of deep learning and neural generative models in many application areas, models for molecular design based on neural networks appeared recently and show promising results. However, the new models have not been profiled on consistent tasks, and comparative studies to well-established algorithms have only seldom been performed. To standardize the assessment of both classical and neura… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

10
883
0
2

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 632 publications
(967 citation statements)
references
References 95 publications
10
883
0
2
Order By: Relevance
“…There are an increasing number of algorithms of these two categories proposed in recent years and a small number of studies that benchmark these algorithms in terms of their ability to generate novel, optimal molecules. 26,27 We categorize the approaches one might take to ensure that computationally designed molecules are able to be synthesized in Figure 1. These represent combinations of (i) a database of known or enumerated compounds, (ii) an evaluator, which estimates the properties we are trying to optimize, (iii) a generator function, which can propose new candidate molecules, (iv) a synthesizability oracle that determines whether it is straightforward to synthesize a given molecule, and/or (v) a heuristic synthesizability estimator that provides a computationally-inexpensive scalar measure of synthesizability.…”
Section: Introductionmentioning
confidence: 99%
“…There are an increasing number of algorithms of these two categories proposed in recent years and a small number of studies that benchmark these algorithms in terms of their ability to generate novel, optimal molecules. 26,27 We categorize the approaches one might take to ensure that computationally designed molecules are able to be synthesized in Figure 1. These represent combinations of (i) a database of known or enumerated compounds, (ii) an evaluator, which estimates the properties we are trying to optimize, (iii) a generator function, which can propose new candidate molecules, (iv) a synthesizability oracle that determines whether it is straightforward to synthesize a given molecule, and/or (v) a heuristic synthesizability estimator that provides a computationally-inexpensive scalar measure of synthesizability.…”
Section: Introductionmentioning
confidence: 99%
“…We assessed the generated molecules with a range of 2D and 3D metrics. As is standard in the assessment of models for molecule generation, 37 we first checked the generated molecules for validity, uniqueness, and novelty. We then determined if the generated linkers were consistent with the 2D property filters used to produce the training set.…”
Section: Methodsmentioning
confidence: 99%
“…Modern machine learning tools have become impactful, perhaps even essential, in the early stages of the drug design process. Today, deep learning models excel at predicting various chemical properties [1], and deep generative models in conjunction with reinforcement learning are able to efficiently search molecular space for molecules that can optimize several chemical properties simultaneously [2].…”
Section: Introductionmentioning
confidence: 99%