Multiple search methods for similarity-based virtual screening: analysis of search overlap and precision

Holliday, John D.; Kanoulas, Evangelos; Malim, Nurul Hashimah Ahamed Hassain; Willett, Peter

doi:10.1186/1758-2946-3-29

Cited by 29 publications

(30 citation statements)

References 31 publications

(39 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Nevertheless, the overall performance of data fusion techniques, particularly using the SUM rule, is superior compared to other methods. This is consistent with previous studies on multiple search methods showing a systematic improvement of compound ranking by applying data fusion techniques 68,69. Comparing results obtained for crystal structures to those for different quality protein models demonstrates a fairly high insensitivity of e FindSite to the structure deformations of target receptors.…”

Section: Resultssupporting

confidence: 90%

eFindSite: Enhanced Fingerprint‐Based Virtual Screening Against Predicted Ligand Binding Sites in Protein Models

Feinstein

Bryliński

2014

Molecular Informatics

View full text Add to dashboard Cite

A standard practice for lead identification in drug discovery is ligand virtual screening, which utilizes computing technologies to detect small compounds that likely bind to target proteins prior to experimental screens. A high accuracy is often achieved when the target protein has a resolved crystal structure; however, using protein models still renders significant challenges. Towards this goal, we recently developed eFindSite that predicts ligand binding sites using a collection of effective algorithms, including meta-threading, machine learning and reliable confidence estimation systems. Here, we incorporate fingerprint-based virtual screening capabilities in eFindSite in addition to its flagship role as a ligand binding pocket predictor. Virtual screening benchmarks using the enhanced Directory of Useful Decoys demonstrate that eFindSite significantly outperforms AutoDock Vina as assessed by several evaluation metrics. Importantly, this holds true regardless of the quality of target protein structures. As a first genome-wide application of eFindSite, we conduct large-scale virtual screening of the entire proteome of Escherichia coli with encouraging results. In the new approach to fingerprint-based virtual screening using remote protein homology, eFindSite demonstrates its compelling proficiency offering a high ranking accuracy and low susceptibility to target structure deformations. The enhanced version of eFindSite is freely available to the academic community at http://www.brylinski.org/efindsite.

show abstract

Section: Resultssupporting

confidence: 90%

eFindSite: Enhanced Fingerprint‐Based Virtual Screening Against Predicted Ligand Binding Sites in Protein Models

Feinstein

Bryliński

2014

Molecular Informatics

View full text Add to dashboard Cite

show abstract

“…It might thus be advisable to select one method from each group for similarity searching and compare ranked results lists, e.g. by data fusion 23. We wish to point out that the grouping of methods depicted in Figure 3 should be treated with caution, as the dendrograms are likely to vary for other reference data sets and chemotype/target coverage.…”

mentioning

confidence: 99%

Chemically Advanced Template Search (CATS) for Scaffold‐Hopping and Prospective Target Prediction for ‘Orphan’ Molecules

Reutlinger

Koch

Reker

et al. 2013

Molecular Informatics

141

146

View full text Add to dashboard Cite

“…A more successful analytic study was reported by Holliday et al. [45] They demonstrated that a power law distribution could be used to predict to a fair degree of accuracy the numbers of database structures common to one similarity search, to two similarity searches, to three etc. They also demonstrated that the proportion of actives increased rapidly in these sets of common structures: the probability of activity of a database structure hence increases in line with its frequency of retrieval in multiple similarity searches, thus providing a simple empirical justification for the use of fusion methods in virtual screening.…”

Section: Data Fusionmentioning

confidence: 99%

Chemoinformatics at the University of Sheffield 2002–2014

Gillet

Holliday

Willett

2015

Molecular Informatics

Self Cite

View full text Add to dashboard Cite

show abstract

Multiple search methods for similarity-based virtual screening: analysis of search overlap and precision

Cited by 29 publications

References 31 publications

eFindSite: Enhanced Fingerprint‐Based Virtual Screening Against Predicted Ligand Binding Sites in Protein Models

eFindSite: Enhanced Fingerprint‐Based Virtual Screening Against Predicted Ligand Binding Sites in Protein Models

Chemically Advanced Template Search (CATS) for Scaffold‐Hopping and Prospective Target Prediction for ‘Orphan’ Molecules

Chemoinformatics at the University of Sheffield 2002–2014

Contact Info

Product

Resources

About