1995
DOI: 10.1007/bf00117274
|View full text |Cite
|
Sign up to set email alerts
|

New molecular shape descriptors: Application in database screening

Abstract: Geometric descriptors are becoming popular tools for encoding molecular shape, for use in database screening and clustering calculations. They provide condensed representations of complex objects and, as a consequence, can usually be compared quite rapidly. Here we present a number of new descriptors and methods for the quantification of molecular shape similarity. The techniques are tested using two different biological systems, with particular emphasis on their potential utility as methods for prescreening s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
31
0

Year Published

1995
1995
2007
2007

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 64 publications
(31 citation statements)
references
References 21 publications
(28 reference statements)
0
31
0
Order By: Relevance
“…However, histogram calculation has a number of well-known drawbacks, specially for very large databases, such as the difficulty of selecting a bin size suitable for all molecules in the database along with the requirement of relatively large storage and computing power. [16][17][18] These difficulties are circumvented by calculating instead the first moments of each of the four distributions of atomic distances in order to characterize them as a way to encode the molecular shape. Such approach is based on a theorem 30 from statistics, which proves that a distribution is completely determined by its moments.…”
Section: An Ultrafast Methods For Shape Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, histogram calculation has a number of well-known drawbacks, specially for very large databases, such as the difficulty of selecting a bin size suitable for all molecules in the database along with the requirement of relatively large storage and computing power. [16][17][18] These difficulties are circumvented by calculating instead the first moments of each of the four distributions of atomic distances in order to characterize them as a way to encode the molecular shape. Such approach is based on a theorem 30 from statistics, which proves that a distribution is completely determined by its moments.…”
Section: An Ultrafast Methods For Shape Recognitionmentioning
confidence: 99%
“…19), which are therefore not intended to describe exclusively molecular shape and thus are outside the scope of this article. These techniques are fast (in the region of 500-2 000 comparisons per second 18 on a 1995 PC), but they are known to be less effective than superposition methods, 20 and thus they are normally used for database prescreening as suggested by their authors 18 (i.e. quickly filtering molecules with very different shapes) instead of stand-alone molecular shape comparison.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…An example of the nearest neighbours obtained in a Tanimoto-based similarity search is shown in Figure 5, where the close relationship to the target structure is clearly evident. Later work in Sheffield [64] evaluated the inter-atomic distance screens that are used for 3D substructure searching when applied to the calculation of 3D structural similarities, and this occasioned several subsequent discussions of the use of both distance-based and angle-based methods for 3D similarity searching (see, e.g., [65][66][67]). However, most of the research, both in Sheffield and elsewhere, of 3D similarity measures has focused on two alternative approaches: the use of the maximum common substructure (MCS) and of molecular field overlaps.…”
Section: Similarity Searchingmentioning
confidence: 99%
“…Many examples of 3D geometric descriptors have been given in the literature [1][2][3][5][6][7][8][9]11]. Pepperell and Willett considered a number of 3D similarity measures involving interatomic distance measures for the purpose of database clustering [3,5].…”
Section: Introductionmentioning
confidence: 99%