We present a computational method for the reaction-based de novo design of drug-like molecules. The software DOGS (Design of Genuine Structures) features a ligand-based strategy for automated ‘in silico’ assembly of potentially novel bioactive compounds. The quality of the designed compounds is assessed by a graph kernel method measuring their similarity to known bioactive reference ligands in terms of structural and pharmacophoric features. We implemented a deterministic compound construction procedure that explicitly considers compound synthesizability, based on a compilation of 25'144 readily available synthetic building blocks and 58 established reaction principles. This enables the software to suggest a synthesis route for each designed compound. Two prospective case studies are presented together with details on the algorithm and its implementation. De novo designed ligand candidates for the human histamine H4 receptor and γ-secretase were synthesized as suggested by the software. The computational approach proved to be suitable for scaffold-hopping from known ligands to novel chemotypes, and for generating bioactive molecules with drug-like properties.
A focused collection of organic synthesis reactions for computer-based molecule construction is presented. It is inspired by real-world chemistry and has been compiled in close collaboration with medicinal chemists to achieve high practical relevance. Virtual molecules assembled from existing starting material connected by these reactions are supposed to have an enhanced chance to be amenable to real chemical synthesis. About 50% of the reactions in the dataset are ring-forming reactions, which fosters the assembly of novel ring systems and innovative chemotypes. A comparison with a recent survey of the reactions used in early drug discovery revealed considerable overlaps with the collection presented here. The dataset is available encoded as computer-readable Reaction SMARTS expressions from the Supporting Information presented for this paper.
Computer-assisted molecular design supports drug discovery by suggesting novel chemotypes and compound modifications for lead structure optimization. While the aspect of synthetic feasibility of the automatically designed compounds has been neglected for a long time, we are currently witnessing an increased interest in this topic. Here, we review state-of-the-art software for de novo drug design with a special emphasis on fragment-based techniques that generate druglike, synthetically accessible compounds. The importance of scoring functions that can be used to predict compound reactivity and potency is highlighted, and several promising solutions are discussed. Recent practical validation studies are presented that have already demonstrated that rule-based fragment assembly can result in novel synthesizable compounds with druglike properties and a desired biological activity.
Computer‐assisted drug design is evolving as a source of innovation for drug discovery. In particular, de novo design approaches are being increasingly applied to find novel drug‐like compounds, molecular scaffolds, and bioisosteric replacements for established or unwanted fragments. Although some of the early software tools had a certain tendency to generate compounds of limited chemical attraction, modern de novo design algorithms put a strong emphasis on the synthesizability and drug‐likeness of machine‐generated compounds. We give an overview of the various methodologies for virtual compound construction, evaluation, and optimization in machina, and how they can support medicinal chemistry projects in the early phase of drug discovery. © 2011 John Wiley & Sons, Ltd. WIREs Comput Mol Sci 2011 1 742–759 DOI: 10.1002/wcms.49 This article is categorized under: Computer and Information Science > Chemoinformatics
In the search for new bioactive compounds, there is a trend toward increasingly complex compound libraries aiming to target the demanding targets of the future. In contrast, medicinal chemistry and traditional library design rely mainly on a small set of highly established and robust reactions. Here, we probe a set of 58 such reactions for their ability to sample the chemical space of known bioactive molecules, and the potential to create new scaffolds. Combined with ~26,000 common available building blocks, the reactions retrieve around 9% of a scaffold-diverse set of compounds active on human target proteins covering all major pharmaceutical target classes. Almost 80% of generated scaffolds from virtual one-step synthesis products are not present in a large set of known bioactive molecules for human targets, indicating potential for new discoveries. The results suggest that established synthesis resources are well suited to cover the known bioactivity-relevant chemical space and that there are plenty of unexplored regions accessible by these reactions, possibly providing valuable "low-hanging fruit" for hit discovery.
We present a fast stochastic optimization algorithm for fragment-based molecular de novo design (COLIBREE, Combinatorial Library Breeding). The search strategy is based on a discrete version of particle swarm optimization. Molecules are represented by a scaffold, which remains constant during optimization, and variable linkers and side chains. Different linkers represent virtual chemical reactions. Side-chain building blocks were obtained from pseudo-retrosynthetic dissection of large compound databases. Here, ligand-based design was performed using chemically advanced template search (CATS) topological pharmacophore similarity to reference ligands as fitness function. A weighting scheme was included for particle swarm optimization-based molecular design, which permits the use of many reference ligands and allows for positive and negative design to be performed simultaneously. In a case study, the approach was applied to the de novo design of potential peroxisome proliferator-activated receptor subtype-selective agonists. The results demonstrate the ability of the technique to cope with large combinatorial chemistry spaces and its applicability to focused library design. The technique was able to perform exploitation of a known scheme and at the same time explorative search for novel ligands within the framework of a given molecular core structure. It thereby represents a practical solution for compound screening in the early hit and lead finding phase of a drug discovery project.
Computer-based de novo design of screening candidates in combination with ligand- and receptor-based virtual screening generates motivated suggestions for focused library design in hit and lead discovery. Attractive, synthetically accessible compounds can be obtained together with predicted on- and off-target profiles and desired activities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.