Designing a molecule with desired properties is one of the biggest challenges in drug development, as it requires optimization of chemical compound structures with respect to many complex properties. To augment the compound design process we introduce Mol-CycleGAN -a CycleGAN-based model that generates optimized compounds with high structural similarity to the original ones. Namely, given a molecule our model generates a structurally similar one with an optimized value of the considered property. We evaluate the performance of the model on selected optimization objectives related to structural properties (presence of halogen groups, number of aromatic rings) and to a physicochemical property (penalized logP).In the task of optimization of penalized logP of drug-like molecules our model significantly outperforms previous results.
Designing compounds with desired properties is a key
element of
the drug discovery process. However, measuring progress in the field
has been challenging due to the lack of realistic retrospective benchmarks,
and the large cost of prospective validation. To close this gap, we
propose a benchmark based on docking, a widely used computational
method for assessing molecule binding to a protein. Concretely, the
goal is to generate drug-like molecules that are scored highly by
SMINA, a popular docking software. We observe that various graph-based
generative models fail to propose molecules with a high docking score
when trained using a realistically sized training set. This suggests
a limitation of the current incarnation of models for de novo drug design. Finally, we also include simpler tasks in the benchmark
based on a simpler scoring function. We release the benchmark as an
easy to use package available at . We hope that our benchmark will serve as a stepping stone toward
the goal of automatically generating promising drug candidates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.