2021
DOI: 10.1021/acs.jcim.0c01328
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Study of Deep Generative Models on Chemical Space Coverage

Abstract: In recent years, deep molecular generative models have emerged as promising methods for de novo molecular design. Thanks to the rapid advance of deep learning techniques, deep learning architectures such as recurrent neural networks, variational autoencoders, and adversarial networks have been successfully employed for constructing generative models. Recently, quite a few metrics have been proposed to evaluate these deep generative models. However, many of these metrics cannot evaluate the chemical space cover… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
31
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 35 publications
(32 citation statements)
references
References 43 publications
1
31
0
Order By: Relevance
“…Despite improvement in the optimization ability by AHC, it is irrelevant if the resulting de novo structures are invalid or implausible (e.g., incorrect valences, unstable or idiosyncratic functional groups or strained ring systems). The chemistry generated by RNNs has been evaluated previously [ 3 , 23 , 33 , 98 , 99 ] and has usually been considered reasonable with respect to overall topology, fragments, substructures and property space. On the other hand, a comparison of chemistry between AHC and REINVENT is complicated by the scoring function and its suitability for an objective e.g., greater optimization may actually lead to unreasonable chemistry due to scoring function exploitation rather than as a function of the RL strategy.…”
Section: Resultsmentioning
confidence: 99%
“…Despite improvement in the optimization ability by AHC, it is irrelevant if the resulting de novo structures are invalid or implausible (e.g., incorrect valences, unstable or idiosyncratic functional groups or strained ring systems). The chemistry generated by RNNs has been evaluated previously [ 3 , 23 , 33 , 98 , 99 ] and has usually been considered reasonable with respect to overall topology, fragments, substructures and property space. On the other hand, a comparison of chemistry between AHC and REINVENT is complicated by the scoring function and its suitability for an objective e.g., greater optimization may actually lead to unreasonable chemistry due to scoring function exploitation rather than as a function of the RL strategy.…”
Section: Resultsmentioning
confidence: 99%
“…To evaluate the models, we considered the following metrics: Percent validity and uniqueness of sampled structures, Training and validation losses, Chemical space coverage, Fraction molecules reproduced from reference set, Fraction ring systems reproduced from reference set, Fraction functional groups reproduced from reference set, Shape analysis, , NPR1 and NPR2 distributions (aka “PMI plots”). Above, the term reference set refers to the combined training, testing, and validation sets for each COCONUT subset. The validity was evaluated for 1M samples, while uniqueness and coverage were evaluated only for the valid samples.…”
Section: Methodsmentioning
confidence: 99%
“…[105] A range of algorithms have been tried out with varying level of success, among which most popular and versatile have been, variational auto encoders [106] and generative adversarial networks. [107] Both generative schemes can operate on image (pixels and voxels), [108][109][110] graph (nodes and edges) [111][112][113] (e.g., SMILES) and fingerprint-vector based representations [114] Figure 5. Cross-section through the negative electrode, the SEI, and the electrolyte.…”
Section: Deep-learned Models and Explainable Aimentioning
confidence: 99%