2005
DOI: 10.1021/ci050039t
|View full text |Cite
|
Sign up to set email alerts
|

Graph Kernels for Molecular Structure−Activity Relationship Analysis with Support Vector Machines

Abstract: The support vector machine algorithm together with graph kernel functions has recently been introduced to model structure-activity relationships (SAR) of molecules from their 2D structure, without the need for explicit molecular descriptor computation. We propose two extensions to this approach with the double goal to reduce the computational burden associated with the model and to enhance its predictive accuracy: description of the molecules by a Morgan index process and definition of a second-order Markov mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
143
0

Year Published

2006
2006
2017
2017

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 161 publications
(144 citation statements)
references
References 21 publications
(72 reference statements)
0
143
0
Order By: Relevance
“…This kernel is defined as the Tanimoto coefficient between fingerprints indicating the presence or absence of all possible molecular fragments of length up to 8 in the 2D structure of the molecule, where a fragment refers to a sequence of atoms connected by covalent bonds. We note that this fingerprint is similar to classical 2D-fingerprints such as the Daylight representation 4 , with the difference that our implementation does not require to fold the fingerprint into a small-size vector (18).…”
Section: Methodsmentioning
confidence: 92%
See 1 more Smart Citation
“…This kernel is defined as the Tanimoto coefficient between fingerprints indicating the presence or absence of all possible molecular fragments of length up to 8 in the 2D structure of the molecule, where a fragment refers to a sequence of atoms connected by covalent bonds. We note that this fingerprint is similar to classical 2D-fingerprints such as the Daylight representation 4 , with the difference that our implementation does not require to fold the fingerprint into a small-size vector (18).…”
Section: Methodsmentioning
confidence: 92%
“…We describe this algorithm in the case of the three-points kernels (15), (16) and (17), its extension to the two-points kernels (18), (19) and (20) being straightfoward. Following the notation of Section 5, we represent molecules by complete, atom-based labeled graphs, with the difference that the set of atom labels L defining the vertices labels is considered to be discrete (e.g., the atom types), and the edges are now labeled by the bin index of the corresponding inter-atomic distance.…”
Section: Definition 8 (Two-points Tanimoto Kernel)mentioning
confidence: 99%
“…Paths are allowed to self-intersect and traverse the same node twice, to capture ring structures, but are not allowed to traverse the same edge twice, to avoid "totters". 25 2.4. 2.5D Surface and 3D Kernels Based on Delaunay Tetrahedrizations.…”
Section: Kernel Methodsmentioning
confidence: 99%
“…When data are graphs, such as when we represent a molecular compound by its 2D planar representation, a possible kernel is based on the comparison of the walks in the graphs. [10,7,14] Another interesting class of kernels is the set of multitask kernels, useful when we want to fit different models by sharing information across the tasks. [6] More precisely, suppose we want to fit a model of the form f(x,y) where x is a cell line and y is a chemical, considered as a task in the sense that we want to fit a model specific to each chemical to predict its toxicity across cell lines.…”
Section: Kernels and Multitask Learningmentioning
confidence: 99%