2009
DOI: 10.1007/s11030-009-9108-1
|View full text |Cite
|
Sign up to set email alerts
|

Three-class classification models of logS and logP derived by using GA–CG–SVM approach

Abstract: In this investigation, three-class classification models of aqueous solubility (logS) and lipophilicity (logP) have been developed by using a support vector machine (SVM) method combined with a genetic algorithm (GA) for feature selection and a conjugate gradient method (CG) for parameter optimization. A 5-fold cross-validation and an independent test set method were used to evaluate the SVM classification models. For logS, the overall prediction accuracy is 87.1% for training set and 90.0% for test set. For l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 32 publications
0
20
0
Order By: Relevance
“…Molecular fingerprints are widely applied in substructure/similarity searching,( 35 ) compound clustering,( 36 ) and classification. ( 22 ) In this study, considering both their popularity and public availability, we adopted the MDL MACCS key( 37 ) and the PubChem fingerprint. ( 38 ) The MACCS key is a binary vector of 166 structural and/or physicochemical features (MACCS166), while the PubChem fingerprint represents the presence/absence of 881 substructures (PC881).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Molecular fingerprints are widely applied in substructure/similarity searching,( 35 ) compound clustering,( 36 ) and classification. ( 22 ) In this study, considering both their popularity and public availability, we adopted the MDL MACCS key( 37 ) and the PubChem fingerprint. ( 38 ) The MACCS key is a binary vector of 166 structural and/or physicochemical features (MACCS166), while the PubChem fingerprint represents the presence/absence of 881 substructures (PC881).…”
Section: Methodsmentioning
confidence: 99%
“…We also considered one additional fingerprint consisting of six physicochemical properties (ADD6), which were previously found to be relevant to solubility modeling. 22 , 39 With respect to these physicochemical properties, data sets I and II are rather diverse (Figure 1 ). Regardless of the minimal and maximal values, both data sets have similar distributions with respect to most of these properties.…”
Section: Methodsmentioning
confidence: 99%
“…Researchers have adopted ML‐driven approaches to predict some key physicochemical properties, such as water solubility, membrane permeability, and lipophilicity. We provide a detailed description of each property and discuss the ML‐based techniques that specifically predict the water solubility, 206–210 membrane permeability, 211–213 and lipophilicity 214–219 in the Supporting Information. Although improved ML models have led to better prediction of molecular properties, the lack of standard criteria for performance evaluation has limited the progress.…”
Section: Ai/ml Applications In Drug Discoverymentioning
confidence: 99%
“…In this study, kernel RBF is used to LSSVM classification because its ability to handled the high dimension data [2] and produce a good performance [12].…”
Section: Training Lssvm Using the Optimal Parametermentioning
confidence: 99%