Natural product structure and fragment-based compound development inspire pseudo-natural product design through different combinations of a given natural product fragment set to compound classes expected to be chemically and biologically diverse. We describe the synthetic combination of the fragment-sized natural products quinine, quinidine, sinomenine, and griseofulvin with chromanone or indole-containing fragments to provide a 244-member pseudo-natural product collection. Cheminformatic analyses reveal that the resulting eight pseudo-natural product classes are chemically diverse and share both drug- and natural product-like properties. Unbiased biological evaluation by cell painting demonstrates that bioactivity of pseudo-natural products, guiding natural products, and fragments differ and that combination of different fragments dominates establishment of unique bioactivity. Identification of phenotypic fragment dominance enables design of compound classes with correctly predicted bioactivity. The results demonstrate that fusion of natural product fragments in different combinations and arrangements can provide chemically and biologically diverse pseudo-natural product classes for wider exploration of biologically relevant chemical space.
A new methodology for classifying fragment combinations and characterizing pseudonatural products (PNPs) is described. The source code is based on open-source tools and is organized as a Python package. Tasks can be executed individually or within the context of scalable, robust workflows. First, structures are standardized and duplicate entries are filtered out. Then, molecules are probed for the presence of predefined fragments. For molecules with more than one match, fragment combinations are classified. The algorithm considers the pairwise relative position of fragments within the molecule (fused atoms, linkers, intermediary rings), resulting in 18 different possible fragment combination categories. Finally, all combinations for a given molecule are assembled into a fragment combination graph, with fragments as nodes and combination types as edges. This workflow was applied to characterize PNPs in the ChEMBL database via comparison of fragment combination graphs with natural product (NP) references, represented by the Dictionary of Natural Products. The Murcko fragments extracted from 2000 structures previously described were used to define NP fragments. The results indicate that ca. 23% of the biologically relevant compounds listed in ChEMBL comply to the PNP definition and that, therefore, PNPs occur frequently among known biologically relevant small molecules. The majority (>95%) of PNPs contain two to four fragments, mainly (>95%) distributed in five different combination types. These findings may provide guidance for the design of new PNPs.
Over the past decades, virtual screening has proved itself to be a valuable asset to identify new bioactive compounds. The vast majority of commonly used techniques can be described in three steps: pre-processing the dataset i. e. small (ligands) and eventually larger (receptors) molecules, execute the method and finally analyse the results. Hence, the preparation of ligands is a critical step for success of commonly used virtual screening approaches such as protein-ligand docking, similarity or pharmacophore search. We present here a new workflow, VSPrep, for the pre-processing of small molecules; it is based on freely accessible tools for academics and is integrated within the KNIME platform. It can be used to perform several chemoinformatics tasks such as molecular database cleaning, tautomer and stereoisomer enumeration, focused library design and conformer generation. Additionally, graphical reports of the results are provided to the user as a convenient analysis tool.
: Drug discovery is a challenging and expensive field. Hence, novel in silico tools have been developed in early discovery stage to identify and prioritize novel molecules with suitable physicochemical properties. In many in silico drug design projects, molecular databases are screened by virtual screening tools to search for potential bioactive molecules. The preparation of the molecules is therefore a key step in the success of well-established techniques such as docking, similarity or pharmacophore searching. We review here the lists of several toolkits used in different steps during the cleaning of molecular databases, integrated within a KNIME workflow. During the first step of the automatic workflow, salts are removed, and mixtures are split to get one compound per entry. Then compounds with unwanted features are filtered. Duplicated entries are then deleted while considering stereochemistry. As a compromise between exhaustiveness and computational time, most distributed tautomers at physiological pH are computed. Additionally, various flags are applied to molecules by using either classical molecular descriptors, similarity search to known libraries or substructure search rules. Moreover, stereoisomers are enumerated depending on the unassigned chiral centers. Then, three-dimensional coordinates, and optionally conformers, are generated. This workflow has been already applied to several drug design projects and can be used for molecular database preparation upon request.
Natural Products (NP) are a major source of inspiration to develop novel bioactive substances. Various strategies were developed to make use of the relevance of NPs in drug design strategies. It has been shown that NP-derived fragments may still be of biological relevance. Thus, combining various different NP-derived fragments may result in new biologically relevant molecules. These compounds retain some physicochemical properties of NPs but are not accessible through biosynthesis and were therefore termed pseudo-NPs. Since it is not possible to infer the bioactivity of pseudo-NPs from their NP-derived fragments, it is not straightforward to identify their potential targets. Hence more general, morphological phenotypic screens may be the best methods to identify impacted pathways. We highlight the potential of the pseudo-NP approach combined with a recent morphological-, image-based screening technology termed Cell Painting.
A new methodology for classifying fragment combinations and characterizing pseudo-natural products (PNPs) is described. The source code is based on open-source tools and is organized as a Python package. Tasks can be executed individually or within the context of scalable, robust workflows. First, structures are standardized and duplicate entries are filtered out. Then, molecules are probed for the presence of predefined fragments. For molecules with more than one match, fragment combinations are classified. The algorithm considers the pair-wise relative position of fragments within the molecule (fused atoms, linkers, intermediary rings), resulting in 18 different possible fragment combination categories. Finally, all combinations for a given molecule are assembled into a fragment combination graph, with fragments as nodes and combination types as edges. This workflow was applied to characterize PNPs in the ChEMBL database via comparison of fragment combination graphs with Natural Product (NP) references, represented by the Dictionary of Natural Products. The Murcko fragments extracted from 2,000 structures previously described were used to define NP-fragments. The results indicate that ca. 23% of the biologically relevant compounds listed in ChEMBL comply to the PNP definition, and that, therefore, PNPs occur frequently among known biologically relevant small molecules. The majority (>95%) of PNPs contains two to four fragments, mainly (>95%) distributed in five different combination types. These findings may provide guidance for the design of new PNPs.
A new methodology for classifying fragment combinations and characterizing pseudo-natural products (PNPs) is described. The source code is based on open-source tools and is organized as a Python package. Tasks can be executed individually or within the context of scalable, robust workflows. First, structures are standardized and duplicate entries are filtered out. Then, molecules are probed for the presence of predefined fragments. For molecules with more than one match, fragment combinations are classified. The algorithm considers the pair-wise relative position of fragments within the molecule (fused atoms, linkers, intermediary rings), resulting in 18 different possible fragment combination categories. Finally, all combinations for a given molecule are assembled into a fragment combination graph, with fragments as nodes and combination types as edges. This workflow was applied to characterize PNPs in the ChEMBL database via comparison of fragment combination graphs with Natural Product (NP) references, represented by the Dictionary of Natural Products. The Murcko fragments extracted from 2,000 structures previously described were used to define NP-fragments. The results indicate that ca. 23% of the biologically relevant compounds listed in ChEMBL comply to the PNP definition, and that, therefore, PNPs occur frequently among known biologically relevant small molecules. The majority (>95%) of PNPs contains two to four fragments, mainly (>95%) distributed in five different combination types. These findings may provide guidance for the design of new PNPs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.