2014
DOI: 10.1021/ci500344v
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Study of Parameter Variation When Using Signature Fingerprints Together with Support Vector Machines

Abstract: QSAR modeling using molecular signatures and support vector machines with a radial basis function is increasingly used for virtual screening in the drug discovery field. This method has three free parameters: C, γ, and signature height. C is a penalty parameter that limits overfitting, γ controls the width of the radial basis function kernel, and the signature height determines how much of the molecule is described by each atom signature. Determination of optimal values for these parameters is time-consuming. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
32
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 35 publications
(35 citation statements)
references
References 39 publications
1
32
0
Order By: Relevance
“…Scikit-learn implementation of SVM is based on the Libsvm and the Liblinear libraries [66, 67]. When optimizing the SVM using the non-linear RBF kernel the values for hyper-parameters gamma ( γ ) and Cost ( C ) where selected with similar range to those reported by Alvarsson et al [68]. Here values for gamma ( γ ) tested were 10e−1, 10e−2, 10e−3, 10e−4, 10e−5 and for cost ( C ) 1, 10, 100, 1000.…”
Section: Methodsmentioning
confidence: 99%
“…Scikit-learn implementation of SVM is based on the Libsvm and the Liblinear libraries [66, 67]. When optimizing the SVM using the non-linear RBF kernel the values for hyper-parameters gamma ( γ ) and Cost ( C ) where selected with similar range to those reported by Alvarsson et al [68]. Here values for gamma ( γ ) tested were 10e−1, 10e−2, 10e−3, 10e−4, 10e−5 and for cost ( C ) 1, 10, 100, 1000.…”
Section: Methodsmentioning
confidence: 99%
“…The two major categories of vHTS approaches are molecular simulation (e.g., AutoDock, DOCK, Flex, AMBER, GROMACS, CHARMM), and ligand‐based scoring . Molecular simulation uses models and concepts from physics, chemistry, and mathematics to make optimized ligand/substrate interaction predictions.…”
Section: Introductionmentioning
confidence: 99%
“…The two major categories of vHTS approaches are molecular simulation (e.g., AutoDock, DOCK, Flex, AMBER, GRO-MACS, CHARMM), [41][42][43][44][45][46][47][48][49][50][51][52] and ligand-based scoring. 49,[53][54][55] Molecular simulation uses models and concepts from physics, chemistry, and mathematics to make optimized ligand/substrate interaction predictions. Although molecular simulations require minimal experimental data, it is computationally expensive and requires numerical solution convergence for confidence.…”
Section: Introductionmentioning
confidence: 99%
“…Often, the QSAR model algorithms come with free parameters that need to be determined, e.g., support vector machines based on the radial basis function has the free parameters and cost  [47] and k-nearest neighbour has k  [46]. A common way of determining actual values for parameters such as these is a grid search or “parameter sweep”.…”
Section: Methodsmentioning
confidence: 99%