2004
DOI: 10.1021/ci049869h
|View full text |Cite
|
Sign up to set email alerts
|

Effect of Molecular Descriptor Feature Selection in Support Vector Machine Classification of Pharmacokinetic and Toxicological Properties of Chemical Agents

Abstract: Statistical-learning methods have been developed for facilitating the prediction of pharmacokinetic and toxicological properties of chemical agents. These methods employ a variety of molecular descriptors to characterize structural and physicochemical properties of molecules. Some of these descriptors are specifically designed for the study of a particular type of properties or agents, and their use for other properties or agents might generate noise and affect the prediction accuracy of a statistical learning… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
182
0

Year Published

2006
2006
2012
2012

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 153 publications
(187 citation statements)
references
References 75 publications
(189 reference statements)
5
182
0
Order By: Relevance
“…The choice of 50% was arbitrary although it has also been used in previous studies (Niwa 2003). There have been a number of different cutoffs used, from 10% (Palm et al 1997) up to 70% (Xue et al 2004), with no standard defined. Table 4, referring to models built from training set TS1, shows that for the classification of the validation set the best overall classification accuracy was 0.958 (481/502), the highest specificity value was 0.952 (441/460) and the best sensitivity was 0.959 (40/42), all using model 3.…”
Section: Classification Analysismentioning
confidence: 99%
“…The choice of 50% was arbitrary although it has also been used in previous studies (Niwa 2003). There have been a number of different cutoffs used, from 10% (Palm et al 1997) up to 70% (Xue et al 2004), with no standard defined. Table 4, referring to models built from training set TS1, shows that for the classification of the validation set the best overall classification accuracy was 0.958 (481/502), the highest specificity value was 0.952 (441/460) and the best sensitivity was 0.959 (40/42), all using model 3.…”
Section: Classification Analysismentioning
confidence: 99%
“…Area under ROC curve was found to be 0.967, whereas Youden's index was calculated as 0.84. Quite a few researchers have been tried to generate absorption models using different machine learning approaches and reported good results [6,24]. This is the first time we are presenting a comparative study between three potential machine learning approaches viz.…”
Section: Resultsmentioning
confidence: 95%
“…59 The problem of selecting properties which are responsible for given outputs occurs in various machine learning applications. [60][61][62] We use feature selection methods with the objective to detect features that are responsible for the underlying class structure. In addition, we search for feature combinations that reflect or even outperform results using all features.…”
Section: Machine Learning Techniquesmentioning
confidence: 99%