2021
DOI: 10.1021/acs.jcim.1c00777
|View full text |Cite
|
Sign up to set email alerts
|

Exploring Graph Traversal Algorithms in Graph-Based Molecular Generation

Abstract: Here we explore the impact of different graph traversal algorithms on molecular graph generation. We do this by training a graph-based deep molecular generative model to build structures using a node order determined via either a breadth-or depthfirst search algorithm. What we observe is that using a breadth-first traversal leads to better coverage of training data features compared to a depth-first traversal. We have quantified these differences using a variety of metrics on a dataset of natural products. The… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 23 publications
(47 reference statements)
0
4
0
Order By: Relevance
“…This has been achieved by representing every molecular structure as a graph, where the atoms are the nodes, and the bonds are the edges. Using the depth-first search (DFS) algorithm [ 29 ], the innovative idea is to identify the corresponding atoms of the two structures by following the same path from an atom selected as the root to the leaves, matching the same atoms in the two molecular structures between which the RMSD is calculated. The match between two atoms, one in the reference structure and the other in the predicted pose, is based on: ( i ) the atom type, ( ii ) the covalently bound atoms, and ( iii ) the order of such bonds.…”
Section: Resultsmentioning
confidence: 99%
“…This has been achieved by representing every molecular structure as a graph, where the atoms are the nodes, and the bonds are the edges. Using the depth-first search (DFS) algorithm [ 29 ], the innovative idea is to identify the corresponding atoms of the two structures by following the same path from an atom selected as the root to the leaves, matching the same atoms in the two molecular structures between which the RMSD is calculated. The match between two atoms, one in the reference structure and the other in the predicted pose, is based on: ( i ) the atom type, ( ii ) the covalently bound atoms, and ( iii ) the order of such bonds.…”
Section: Resultsmentioning
confidence: 99%
“…DFS traverses a graph structure from an arbitrary starting node, exploring as far as possible along each branch then backtracking along all paths (Mercado et al 2021). In our hydrogen bond analysis tool, a random residue from the residue list is chosen as a starting node, then all the routes from this node that connect to QB are recorded.…”
Section: Openmm MD Simulationmentioning
confidence: 99%
“…Thus, by supplying a start symbol, new symbols can be sampled from the probability distribution corresponding to the next symbol (output by the RNN), which is then recursively fed back into the network resulting in de novo molecules. Despite a wave of newer approaches since (e.g., JT-VAE [ 13 ], DrugEx [ 29 ], GENTRL [ 30 ], GraphINVENT [ 20 , 31 ]), RNNs are still frequently used and investigated for de novo molecule generation (e.g., [ 32 – 34 ]). Furthermore, they still match the state-of-the-art on several de novo molecule generation benchmarks [ 22 , 23 , 35 , 36 ].…”
Section: Introductionmentioning
confidence: 99%