2003
DOI: 10.1016/s0169-7439(03)00054-6
|View full text |Cite
|
Sign up to set email alerts
|

Clustering and similarity of chemical structures represented by binary substructure descriptors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2004
2004
2012
2012

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 20 publications
0
10
0
Order By: Relevance
“…Classification of the presence or absence of substructures based on mass spectral data has found great interest and has been suggested as a tool in systematic structure elucidation of organic molecules (Chapman, 1993;Klawun & Wilkins, 1996;Varmuza & Werther, 1996;Varmuza, 2000). Recently, exploratory data analysis methods have proven to be useful in the evaluation of database search results or spectra similarity hitlists, as well as for the recognition of spectra-structure relationships Scsibrany et al, 2003).…”
Section: A Overviewmentioning
confidence: 99%
“…Classification of the presence or absence of substructures based on mass spectral data has found great interest and has been suggested as a tool in systematic structure elucidation of organic molecules (Chapman, 1993;Klawun & Wilkins, 1996;Varmuza & Werther, 1996;Varmuza, 2000). Recently, exploratory data analysis methods have proven to be useful in the evaluation of database search results or spectra similarity hitlists, as well as for the recognition of spectra-structure relationships Scsibrany et al, 2003).…”
Section: A Overviewmentioning
confidence: 99%
“…The Ames database was created from six different public sources [ 40 , 47 ] and each chemical structure possesses a class label (0 and 1) that results from the Ames test indicating the genetoxicity of a substance. By starting from the original database Ames mutagenicity [ 40 , 47 ] containing 6512 chemical compounds, we created AG 3982 by filtering out isomorphic graphs based on the software SubMat [ 86 ]. Finally, this procedure resulted in 3982 structurally different skeletons, that is, all atoms and all bonds are considered as equal.…”
Section: Resultsmentioning
confidence: 99%
“…Further, topological descriptors have often been combined with other techniques from statistical data analysis, e.g., clustering methods [ 26 , 48 ] to infer correlations between the used indices. Besides using topological descriptors for characterizing chemical graphs [ 27 , 32 , 49 ], they have also been applied to quantify the structural similarity of chemicals representing networks [ 50 , 51 ]. Among the large number of existing topological indices, an important class of such measures relies on S HANNON 's entropy to characterize graphs by determining their structural information content [ 27 , 52 - 54 ].…”
Section: Introductionmentioning
confidence: 99%
“…Using this matrix classical clustering algorithms can be applied [70][71][72], or also, previously reducing this representational space by applying a PCA over the n-dimensional descriptors representation [73].…”
Section: Pattern Representation Of Chemicals Structuresmentioning
confidence: 99%