2006
DOI: 10.1021/ci060138m
|View full text |Cite
|
Sign up to set email alerts
|

The Pharmacophore Kernel for Virtual Screening with Support Vector Machines

Abstract: We introduce a family of positive definite kernels specifically optimized for the manipulation of 3D structures of molecules with kernel methods. The kernels are based on the comparison of the three-points pharmacophores present in the 3D structures of molecules, a set of molecular features known to be particularly relevant for virtual screening applications. We present a computationally demanding exact implementation of these kernels, as well as fast approximations related to the classical fingerprint-based a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
80
1

Year Published

2007
2007
2018
2018

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 76 publications
(83 citation statements)
references
References 36 publications
(98 reference statements)
0
80
1
Order By: Relevance
“…In our experience, these more complex kernels applied to predicted structures are still outperformed by the best 2D kernels, as shown in several additional experiments that were carried on the four data sets used by Mahé et al 23 for classification problems (Table 7). One exception is the alkane boiling point data set where the 3D contact histogram kernels yield the best predictions.…”
Section: Discussionmentioning
confidence: 74%
“…In our experience, these more complex kernels applied to predicted structures are still outperformed by the best 2D kernels, as shown in several additional experiments that were carried on the four data sets used by Mahé et al 23 for classification problems (Table 7). One exception is the alkane boiling point data set where the 3D contact histogram kernels yield the best predictions.…”
Section: Discussionmentioning
confidence: 74%
“…For instance, in the case of k ) 3, we can use a pharmacophore representation, whereby a molecule is represented by the list of all of its triplets of atoms (or even groups of atoms), with the pairwise distances between the pairs of atoms in each triplet, and the corresponding labels, which, beyond atom type, can include information about size, polarity, electronegativity, and so forth. This approach is the same as the one recently described in Mahé et al, 23,24 where the authors use a labeling scheme based on the Morgan indices 25,26 that increase the specificity of the labels by including topological information about adjacent atoms. If desirable, a more compact representation is derived by building histograms for each class of triplets (e.g., C-C-C, C-C-O) on the basis of the size of the smallest sphere that contains all three points (or the largest pairwise distance in the triplet).…”
Section: D Kernels Based On Atomic Coordinates and Pharmacophoresmentioning
confidence: 99%
“…Actually, in some domains where the number and/or the diversity of the available examples are limited, as in the domain of chemometry [12], one might learn average properties, these might do well on the test set, and still be poorly related to the target concept; some evidence for the possibility of such a phenomenon was presented in [1], where the test error could be 2% or lower although the concept learned was a gross overgeneralization of the true target concept.…”
Section: Discussion and Perspectivesmentioning
confidence: 99%
“…Besides early approaches [4], specific kernels were designed for MIP problems [5,3,12,11]. The basic idea is to define the kernel K of two bags of instances as the average of the kernels k between their instances:…”
Section: State Of the Artmentioning
confidence: 99%
See 1 more Smart Citation