2009
DOI: 10.3390/molecules14051660
|View full text |Cite|
|
Sign up to set email alerts
|

On Two Novel Parameters for Validation of Predictive QSAR Models

Abstract: Validation is a crucial aspect of quantitative structure–activity relationship (QSAR) modeling. The present paper shows that traditionally used validation parameters (leave-one-out Q2 for internal validation and predictive R2 for external validation) may be supplemented with two novel parameters rm2 and Rp2 for a stricter test of validation. The parameter rm2(overall) penalizes a model for large differences between observed and predicted values of the compounds of the whole set (considering both training and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
97
0
3

Year Published

2011
2011
2020
2020

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 477 publications
(103 citation statements)
references
References 45 publications
(61 reference statements)
0
97
0
3
Order By: Relevance
“…Y vector randomization 25 times yielded poor Q 2 and R 2 values in our case. R 2 p based on Y-randomization was also applied to the generated model which applied a penalty to the models' coefficient of variance for the dissimilarity between mean coefficient of determination (R 2 r ) of randomized models and coefficient of determination (R 2 ) of the non-randomized model [31]. In order to have a superior predictive potential of the model, a modified R 2 (R 2 m ) was determined [31].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Y vector randomization 25 times yielded poor Q 2 and R 2 values in our case. R 2 p based on Y-randomization was also applied to the generated model which applied a penalty to the models' coefficient of variance for the dissimilarity between mean coefficient of determination (R 2 r ) of randomized models and coefficient of determination (R 2 ) of the non-randomized model [31]. In order to have a superior predictive potential of the model, a modified R 2 (R 2 m ) was determined [31].…”
Section: Methodsmentioning
confidence: 99%
“…R 2 p based on Y-randomization was also applied to the generated model which applied a penalty to the models' coefficient of variance for the dissimilarity between mean coefficient of determination (R 2 r ) of randomized models and coefficient of determination (R 2 ) of the non-randomized model [31]. In order to have a superior predictive potential of the model, a modified R 2 (R 2 m ) was determined [31]. Further, the model was externally validated by predicting the activity of external set or test set of compounds and evaluated by means of coefficient of determination (R 2 pred ), standard error of external prediction (SEP), and the average relative error (ARE pred ) [32].…”
Section: Methodsmentioning
confidence: 99%
“…Octanol/water partition coefficients and related maximum residue limits (MRL) in fat tissue LogP  arithmetic mean of octanol/water partition coefficient; LogD7.4  arithmetic mean of octanol/buffer pH 7.4 partition coefficient; CLogD7.4 -octanol/buffer pH 7.4 partition coefficient (MarvinSketch algorithm); CLogP -octanol/water partition coefficient (MarvinSketch algorithm); XLogP -octanol/water partition coefficient (ALOGPS algorithm, PubMed); LogKow -octanol/water partition coefficient (EPI Suite TM algorithm) (16) ) of the observed versus predicted data was obtained. The leave-one-out (LOO) method was used for model cross-validation (14,27). Squared cross-validated correlation coefficient (Q 2 ) parameter and differences between Q 2 and R 2 were calculated as a measure of the internal performance and model predictive ability.…”
Section: Methodsmentioning
confidence: 99%
“…A parameter R 2 p was introduced to evaluate the Y-randomization results. The parameter penalized the model for the difference between squared mean correlation coefficient (R 2 r ) of randomized models and squared correlation coefficient (R 2 ) of the non-randomized model [55] and was calculated by the following equation:…”
Section: Comsia Models Generationmentioning
confidence: 99%
“…To further validate the predictive ability of the model, a modified r 2 (r 2 m (overall) ) was introduced as the following equation [55]:…”
Section: Comsia Models Generationmentioning
confidence: 99%