2011
DOI: 10.1186/1758-2946-3-29
|View full text |Cite
|
Sign up to set email alerts
|

Multiple search methods for similarity-based virtual screening: analysis of search overlap and precision

Abstract: BackgroundData fusion methods are widely used in virtual screening, and make the implicit assumption that the more often a molecule is retrieved in multiple similarity searches, the more likely it is to be active. This paper tests the correctness of this assumption.ResultsSets of 25 searches using either the same reference structure and 25 different similarity measures (similarity fusion) or 25 different reference structures and the same similarity measure (group fusion) show that large numbers of unique molec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
28
0
1

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 29 publications
(30 citation statements)
references
References 31 publications
(39 reference statements)
1
28
0
1
Order By: Relevance
“…Nevertheless, the overall performance of data fusion techniques, particularly using the SUM rule, is superior compared to other methods. This is consistent with previous studies on multiple search methods showing a systematic improvement of compound ranking by applying data fusion techniques 68,69. Comparing results obtained for crystal structures to those for different quality protein models demonstrates a fairly high insensitivity of e FindSite to the structure deformations of target receptors.…”
Section: Resultssupporting
confidence: 90%
“…Nevertheless, the overall performance of data fusion techniques, particularly using the SUM rule, is superior compared to other methods. This is consistent with previous studies on multiple search methods showing a systematic improvement of compound ranking by applying data fusion techniques 68,69. Comparing results obtained for crystal structures to those for different quality protein models demonstrates a fairly high insensitivity of e FindSite to the structure deformations of target receptors.…”
Section: Resultssupporting
confidence: 90%
“…It might thus be advisable to select one method from each group for similarity searching and compare ranked results lists, e.g. by data fusion 23. We wish to point out that the grouping of methods depicted in Figure 3 should be treated with caution, as the dendrograms are likely to vary for other reference data sets and chemotype/target coverage.…”
mentioning
confidence: 99%
“…A more successful analytic study was reported by Holliday et al. [45] They demonstrated that a power law distribution could be used to predict to a fair degree of accuracy the numbers of database structures common to one similarity search, to two similarity searches, to three etc. They also demonstrated that the proportion of actives increased rapidly in these sets of common structures: the probability of activity of a database structure hence increases in line with its frequency of retrieval in multiple similarity searches, thus providing a simple empirical justification for the use of fusion methods in virtual screening.…”
Section: Data Fusionmentioning
confidence: 99%