Analysis and use of fragment-occurrence data in similarity-based virtual screening

Arif, Shereena M.; Holliday, John D.; Willett, Peter

doi:10.1007/s10822-009-9285-0

Cited by 27 publications

(26 citation statements)

References 52 publications

(57 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As another example, Duan et al note that fingerprints can often be implemented in multiple ways, with their extensive comparison of similarity methods for virtual screening involving 11 different parameterisations of the atoms involved in each substructural fragment encoded in a fingerprint [22]; the comparison here has used two popular representations (ECFP4 and FCFP4) in the Pipeline Pilot software to exemplify the use of alternative approaches to atom-typing. Other factors that may affect the effectiveness of fingerprint implementations include: the length of the fingerprint that is used, especially if hashing techniques are employed that can result in substantial numbers of collisions [21]; and whether incidence or occurrence data is used, i.e., whether the fingerprint encodes merely the presence of a fragment, its frequency of occurrence, or some standardised form of the latter [24].…”

Section: Methodsmentioning

confidence: 99%

Effectiveness of 2D Fingerprints for Scaffold Hopping

et al. 2011

Self Cite

View full text Add to dashboard Cite

Universities of LeedsMethods. This paper reports a detailed evaluation of the effectiveness of six common types of 2D fingerprint when they are used for scaffold hopping similarity searches of MDDR, WOMBAT and MUV data.Results. The results demonstrate that 2D fingerprints can be used for scaffold hopping, with novel scaffolds being identified in nearly every search that was carried out. The degree of enrichment depends on the structural diversity of the actives that are being sought, with the greatest enrichments often being obtained using ECFP4 fingerprints. Conclusions. 2D fingerprints provide a simple, and computationally efficient, way of identifying novel chemotypes in lead-discovery programmes.

show abstract

Section: Methodsmentioning

confidence: 99%

Effectiveness of 2D Fingerprints for Scaffold Hopping

et al. 2011

Self Cite

View full text Add to dashboard Cite

show abstract

“…In conventional, unweighted fingerprints x i =1 experiments using sets of bioactive molecules from the MDL Drug Data Report (MDDR) and World of Molecular Bioactivity (WOMBAT) databases. [23][24] The first type of weighting, frequency weighting, is based on the assumption that a fragment that occurs several times in a molecule should make a greater contribution to the overall degree of similarity than if it occurs just once, and that this contribution should be still greater if that fragment also occurs multiple times in the molecule with which it is being compared. Arif et al considered several different ways of using the occurrence information, as detailed in the left-hand side of Table 1, and concluded that the best screening results were obtained by using the square root of the occurrence frequencies.…”

Section: Similarity-based Virtual Screeningmentioning

confidence: 99%

“…Arif et al considered several different ways of using the occurrence information, as detailed in the left-hand side of Table 1, and concluded that the best screening results were obtained by using the square root of the occurrence frequencies. [23] The effect of this scheme is to lessen the contribution of the more generic fragments that can occur relatively frequently within molecules, and that can thus yield high values if raw occurrence counts are used without some form of normalisation. Turning to the second type of weighting, inverse frequency weighting, the basic assumption here is that two molecules that share an infrequently occurring feature (such as a rare heterocycle) should be considered as being more similar to each other than if they share a feature (such as a benzene ring) that occurs very frequently throughout the database that is being searched.…”

Section: Similarity-based Virtual Screeningmentioning

confidence: 99%

Chemoinformatics at the University of Sheffield 2002–2014

Gillet

Holliday

Willett

2015

Molecular Informatics

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, we are going to use only the MDDR database in our experiment. The data is said to be qualitative in MDDR database and a molecule in it is said to be inactive if it is not in the case, whereby the molecule is not exhibiting any specific activity [17].…”

Section: A Chemical Databasementioning

confidence: 99%

New strategy for Turbo Similarity Searching: Implementation and testing

Malim

Pei-Chia

Arif

2013

2013 International Conference on Advanced Computer Science and Information Systems (ICACSIS)

View full text Add to dashboard Cite

Virtual screening is one of the most vital methods applied in Chemoinformatics, the field that contributes to drug discovery process. Turbo Similarity Searching (TSS) and data fusion are two of the latest chemical similarity searching strategies, which has evolved from the conventional similarity searching (SS) that apply the concept of multi-target searching instead of just an individual target search. The indirect relationship exists in TSS, with the inclusion of Nearest Neighbours (NN) has been proven to have better performance than the direct relationship (i.e. between query structure and database structures) that exists in similarity searching process. In this paper, we will focus on the implementation and improvement of the existing TSS. By adding in another layer of indirect relationship between the reference compound and the database compounds, along with an additional fusion layer, the performance of the new TSS strategy can be observed. The initial results indicated that there is an obvious increment in the recall value when applying the new strategy. The results are also evaluated with the significance test to show that the result produced by the new strategy is true and does not occurred by chance. Further work on different activity classes and different descriptors on the new strategy are expected to generate a better performance than the existing TSS.

show abstract

Analysis and use of fragment-occurrence data in similarity-based virtual screening

Cited by 27 publications

References 52 publications

Effectiveness of 2D Fingerprints for Scaffold Hopping

Effectiveness of 2D Fingerprints for Scaffold Hopping

Chemoinformatics at the University of Sheffield 2002–2014

New strategy for Turbo Similarity Searching: Implementation and testing

Contact Info

Product

Resources

About