Within the last few years a considerable amount of evaluative studies has been published that investigate the performance of 3D virtual screening approaches. Thereby, in particular assessments of protein-ligand docking are facing remarkable interest in the scientific community. However, comparing virtual screening approaches is a non-trivial task. Several publications, especially in the field of molecular docking, suffer from shortcomings that are likely to affect the significance of the results considerably. These quality issues often arise from poor study design, biasing, by using improper or inexpressive enrichment descriptors, and from errors in interpretation of the data output. In this review we analyze recent literature evaluating 3D virtual screening methods, with focus on molecular docking. We highlight problematic issues and provide guidelines on how to improve the quality of computational studies. Since 3D virtual screening protocols are in general assessed by their ability to discriminate between active and inactive compounds, we summarize the impact of the composition and preparation of test sets on the outcome of evaluations. Moreover, we investigate the significance of both classic enrichment parameters and advanced descriptors for the performance of 3D virtual screening methods. Furthermore, we review the significance and suitability of RMSD as a measure for the accuracy of protein-ligand docking algorithms and of conformational space sub sampling algorithms.
Drug metabolism can produce metabolites with physicochemical and pharmacological properties that differ substantially from those of the parent drug, and consequently has important implications for both drug safety and efficacy. To reduce the risk of costly clinical-stage attrition due to the metabolic characteristics of drug candidates, there is a need for efficient and reliable ways to predict drug metabolism in vitro, in silico and in vivo. In this Perspective, we provide an overview of the state of the art of experimental and computational approaches for investigating drug metabolism. We highlight the scope and limitations of these methods, and indicate strategies to harvest the synergies that result from combining measurement and prediction of drug metabolism.
In continuation of our studies to evaluate the ability of various conformer generators to produce bioactive conformations, we present the extension of our work on the analysis of Catalyst's conformational subsampling algorithm in a comparative evaluation with OpenEye's currently updated tool Omega 2.0. Our study is based on an enhanced test set of 778 drug molecules and pharmacologically relevant compounds extracted from the Protein Data Bank (PDB). We elaborated protocols for two common conformer generation use cases and applied them to both programs: (i) high-throughput settings for processing large databases and (ii) high-quality settings for binding site exploration or lead structure refinement. While Catalyst is faster in the first case, Omega 2.0 better reproduces the bound ligand conformations from the PDB in less time for the latter case.
Shape-based molecular similarity approaches have been established as important and popular virtual screening techniques. Recent applications have shown successful screening campaigns using different parameters and query selection. It is common sense that pure volume overlap scoring (or "shape-based screening") under-represents chemical or pharmacophoric information of a molecule. Using the "Directory of Useful Decoys" (DUD) as a benchmark set, we systematically evaluate how (i) the choice of query conformations, (ii) the selection of the active compound to be used as a query structure, and (iii) the inclusion of chemical information (i.e., the pharmacophoric properties of the query molecule) affect screening performance. Varying these parameters bears remarkable potential for improvements and delivers the best screening performance reported using these tools so far. From these insights, guidelines on how to reach optimum performance during virtual screening are developed.
Metabolism of xenobiotics remains a central challenge for the discovery and development of drugs, cosmetics, nutritional supplements, and agrochemicals. Metabolic transformations are frequently related to the incidence of toxic effects that may result from the emergence of reactive species, the systemic accumulation of metabolites, or by induction of metabolic pathways. Experimental investigation of the metabolism of small organic molecules is particularly resource demanding; hence, computational methods are of considerable interest to complement experimental approaches. This review provides a broad overview of structure- and ligand-based computational methods for the prediction of xenobiotic metabolism. Current computational approaches to address xenobiotic metabolism are discussed from three major perspectives: (i) prediction of sites of metabolism (SOMs), (ii) elucidation of potential metabolites and their chemical structures, and (iii) prediction of direct and indirect effects of xenobiotics on metabolizing enzymes, where the focus is on the cytochrome P450 (CYP) superfamily of enzymes, the cardinal xenobiotics metabolizing enzymes. For each of these domains, a variety of approaches and their applications are systematically reviewed, including expert systems, data mining approaches, quantitative structure–activity relationships (QSARs), and machine learning-based methods, pharmacophore-based algorithms, shape-focused techniques, molecular interaction fields (MIFs), reactivity-focused techniques, protein–ligand docking, molecular dynamics (MD) simulations, and combinations of methods. Predictive metabolism is a developing area, and there is still enormous potential for improvement. However, it is clear that the combination of rapidly increasing amounts of available ligand- and structure-related experimental data (in particular, quantitative data) with novel and diverse simulation and modeling approaches is accelerating the development of effective tools for prediction of in vivo metabolism, which is reflected by the diverse and comprehensive data sources and methods for metabolism prediction reviewed here. This review attempts to survey the range and scope of computational methods applied to metabolism prediction and also to compare and contrast their applicability and performance.
We examined the quality of Catalyst's conformational model generation algorithm via a large scale study based on the crystal structures of a sample of 510 pharmaceutically relevant protein-ligand complexes extracted from the Protein Data Bank (PDB). Our results show that the tested algorithms implemented within Catalyst are able to produce high quality conformers, which in most of the cases are well suited for in silico drug research. Catalyst-specific settings were analyzed, such as the method used for the conformational model generation (FAST vs BEST) and the maximum number of generated conformers. By setting these options for higher fitting quality, the average RMS values describing the similarity of experimental and simulated conformers were improved from an RMS of 1.06 with max. 50 FAST generated conformers to an RMS of 0.93 with max. 255 BEST generated conformers, which represents an improvement by 12%. Each method provides best fitting conformers with an RMS value<1.50 in more than 80% of all cases. We analyzed the computing time/quality ratio of various conformational model generation settings and examined ligands in high energy conformations. Furthermore, properties of the same ligands in various proteins were investigated, and the fitting qualities of experimental conformations from the PDB and the Cambridge Structural Database (CSD) were compared. One of the most important conclusions of former studies, the fact that bioactive conformers often have energy high above that of global minima, was confirmed.
Natural products from plants, animals, marine life, fungi, bacteria, and other organisms are an important resource for modern drug discovery. Their biological relevance and structural diversity make natural products good starting points for drug design. Natural product-based drug discovery can benefit greatly from computational approaches, which are a valuable precursor or supplementary method to in vitro testing. We present an overview of 25 virtual and 31 physical natural product libraries that are useful for applications in cheminformatics, in particular virtual screening. The overview includes detailed information about each library, the extent of its structural information, and the overlap between different sources of natural products. In terms of chemical structures, there is a large overlap between freely available and commercial virtual natural product libraries. Of particular interest for drug discovery is that at least ten percent of known natural products are readily purchasable and many more natural products and derivatives are available through on-demand sourcing, extraction and synthesis services. Many of the readily purchasable natural products are of small size and hence of relevance to fragment-based drug discovery. There are also an increasing number of macrocyclic natural products and derivatives becoming available for screening.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.