BackgroundIn the present work, we aim to transfer to the field of virtual screening the predictiveness curve, a metric that has been advocated in clinical epidemiology. The literature describes the use of predictiveness curves to evaluate the performances of biological markers to formulate diagnoses, prognoses and assess disease risks, assess the fit of risk models, and estimate the clinical utility of a model when applied to a population. Similarly, we use logistic regression models to calculate activity probabilities related to the scores that the compounds obtained in virtual screening experiments. The predictiveness curve can provide an intuitive and graphical tool to compare the predictive power of virtual screening methods.ResultsSimilarly to ROC curves, predictiveness curves are functions of the distribution of the scores and provide a common scale for the evaluation of virtual screening methods. Contrarily to ROC curves, the dispersion of the scores is well described by predictiveness curves. This property allows the quantification of the predictive performance of virtual screening methods on a fraction of a given molecular dataset and makes the predictiveness curve an efficient tool to address the early recognition problem. To this last end, we introduce the use of the total gain and partial total gain to quantify recognition and early recognition of active compounds attributed to the variations of the scores obtained with virtual screening methods. Additionally to its usefulness in the evaluation of virtual screening methods, predictiveness curves can be used to define optimal score thresholds for the selection of compounds to be tested experimentally in a drug discovery program. We illustrate the use of predictiveness curves as a complement to ROC on the results of a virtual screening of the Directory of Useful Decoys datasets using three different methods (Surflex-dock, ICM, Autodock Vina).ConclusionThe predictiveness curves cover different aspects of the predictive power of the scores, allowing a detailed evaluation of the performance of virtual screening methods. We believe predictiveness curves efficiently complete the set of tools available for the analysis of virtual screening results.Electronic supplementary materialThe online version of this article (doi:10.1186/s13321-015-0100-8) contains supplementary material, which is available to authorized users.
The statistical and biological relevance of these associations and their high ORs underscore the power of extreme phenotypes for GWASs, even with a modest sample size. These genetic results emphasize the role of the transforming growth factor beta pathway in the pathogenesis of HIV-1 disease. Finally, the wealth of information provided by this study should help unravel new diagnostic and therapeutic targets.
Nuclear receptors (NRs) constitute an important class of drug targets. We created the most exhaustive NR-focused benchmarking database to date, the NRLiSt BDB (NRs ligands and structures benchmarking database). The 9905 compounds and 339 structures of the NRLiSt BDB are ready for structure-based and ligand-based virtual screening. In the present study, we detail the protocol used to generate the NRLiSt BDB and its features. We also give some examples of the errors that we found in ChEMBL that convinced us to manually review all original papers. Since extensive and manually curated experimental data about NR ligands and structures are provided in the NRLiSt BDB, it should become a powerful tool to assess the performance of virtual screening methods on NRs, to assist the understanding of NR's function and modulation, and to support the discovery of new drugs targeting NRs. NRLiSt BDB is freely available online at http://nrlist.drugdesign.fr .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.