2021
DOI: 10.26434/chemrxiv.13143167.v3
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Masked Graph Modeling for Molecule Generation

Abstract: De novo, in-silico design of molecules is a challenging problem with applications in drug discovery and material design. We introduce a masked graph model, which learns a distribution over graphs by capturing conditional distributions over unobserved nodes (atoms) and edges (bonds) given observed ones. We train and then sample from our model by iteratively masking and replacing different parts of initialized graphs.<br>We evaluate our approach on the QM9 and ChEMBL datasets using the GuacaMol distributio… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 38 publications
0
3
0
Order By: Relevance
“…These language models can be used to generate molecular libraries for drug discovery 6 or built into variational autoencoders (VAE) 3 , 7 where bayesian optimization can be used to search through the model’s latent space for drug-like molecules. Other models generate molecules as graphs either sequentially 8 – 14 using graph neural networks 15 , 16 or generate whole molecules in one shot 17 – 20 . Two of the most popular: CGAVE and JTVAE can be directly constrained to enforce valency restrictions.…”
Section: Introductionmentioning
confidence: 99%
“…These language models can be used to generate molecular libraries for drug discovery 6 or built into variational autoencoders (VAE) 3 , 7 where bayesian optimization can be used to search through the model’s latent space for drug-like molecules. Other models generate molecules as graphs either sequentially 8 – 14 using graph neural networks 15 , 16 or generate whole molecules in one shot 17 – 20 . Two of the most popular: CGAVE and JTVAE can be directly constrained to enforce valency restrictions.…”
Section: Introductionmentioning
confidence: 99%
“…Code, pretrained MGM models, training and generation scripts for MGM and baseline models, and lists of generated molecules can be found at https://github.com/nyu-dl/ dl4chem-mgm 66 .…”
Section: Data Availabilitymentioning
confidence: 99%
“…Deep learning (DL) generative model offers a promising approach for accessing the chemical space. Molecular generative models can learn the distribution of a set of known chemical compounds and output new molecules in a highly efficient and reliable manner [10][11][12][13][14][15] . These models are essentially ligand-based, as they rely on the prior knowledge of known actives to design similar molecules that potentially bind to the same target with the possibility to fine-tune their physicochemical properties 16,17 .…”
Section: Introductionmentioning
confidence: 99%