Computers in chemistryComputers in chemistry V 0380 Virtual Screening Using Protein-Ligand Docking: Avoiding Artificial Enrichment. -(VERDONK*, M. L.; BERDINI, V.; HARTSHORN, M. J.; MOOIJ, W. T. M.; MURRAY, C. W.; TAYLOR, R. D.; WATSON, P.; J. Chem. Inf. Comput. Sci. 44 (2004) 3, 793-806; Astex
This study addresses a number of topical issues around the use of protein-ligand docking in virtual screening. We show that, for the validation of such methods, it is key to use focused libraries (containing compounds with one-dimensional properties, similar to the actives), rather than "random" or "drug-like" libraries to test the actives against. We also show that, to obtain good enrichments, the docking program needs to produce reliable binding modes. We demonstrate how pharmacophores can be used to guide the dockings and improve enrichments, and we compare the performance of three consensus-ranking protocols against ranking based on individual scoring functions. Finally, we show that protein-ligand docking can be an effective aid in the screening for weak, fragment-like binders, which has rapidly become a popular strategy for hit identification. All results presented are based on carefully constructed virtual screening experiments against four targets, using the protein-ligand docking program GOLD.
A naïve Bayes classifier, employed in conjunction with 2D pharmacophore feature triplet vectors describing the molecules, is presented and validated. Molecules are described using a vector where each element in the vector contains the number of times a particular triplet of atom-based features separated by a set of topological distances occurs. Using the feature triplet vectors it is possible to generate naïve Bayes classifiers that predict whether molecules are likely to be active against a given target (or family of targets). Two retrospective validation experiments were performed using a range of actives from WOMBAT, the Prous Integrity database, and the Arena screening library. The performance of the classifiers was evaluated using enrichment curves, enrichment factors, and the BEDROC metric. The classifiers were found to give significant enrichments for the various test sets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.