De novo molecular design plays an
important role in drug
discovery.
Here, a novel generative model, Tree-Invent, was proposed to integrate
topological constraints in the generation of a molecular graph. In
this model, a molecular graph is represented as a topological tree
in which a ring system, a nonring atom, and a chemical bond are regarded
as the ring node, single node, and edge, respectively. The molecule
generation is driven by three independent submodels for carrying out
operations of node addition, ring generation, and node connection.
One unique feature of the generative model is that the topological
tree structure can be specified as a constraint for structure generation,
which provides more precise control of structure generation. Combined
with reinforcement learning, the Tree-Invent model could efficiently
explore targeted chemical space. Moreover, the Tree-Invent model is
flexible enough to be used in versatile molecule design settings such
as scaffold decoration, scaffold hopping, and linker generation.