2018
DOI: 10.3390/molecules23112756
|View full text |Cite
|
Sign up to set email alerts
|

Study of the Applicability Domain of the QSAR Classification Models by Means of the Rivality and Modelability Indexes

Abstract: The reliability of a QSAR classification model depends on its capacity to achieve confident predictions of new compounds not considered in the building of the model. The results of this external validation process show the applicability domain (AD) of the QSAR model and, therefore, the robustness of the model to predict the property/activity of new molecules. In this paper we propose the use of the rivality and modelability indexes for the study of the characteristics of the datasets to be correctly modeled by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 26 publications
0
15
0
Order By: Relevance
“…The concept of evaluating the predictability or modelability, essentially based on distance and similarity measures, has been developed in several studies [ 23 , 24 , 25 , 26 ]. The distance or similarity between molecules was estimated from the predictions of activities done by previously developed QSAR models.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The concept of evaluating the predictability or modelability, essentially based on distance and similarity measures, has been developed in several studies [ 23 , 24 , 25 , 26 ]. The distance or similarity between molecules was estimated from the predictions of activities done by previously developed QSAR models.…”
Section: Introductionmentioning
confidence: 99%
“…Then, the modelability index for classification QSAR endpoint is defined as the ratio of compounds having the first nearest neighbor in the same class to the total number of compounds in the data set [ 24 ]. Later, also analogous modelability index based on the Euclidean distance measured between compounds in feature space and activity prediction by classification QSAR models was introduced for classification endpoints [ 25 ]. An alternative and conceptually simpler method for estimating modelability is the one used by Thomas et al [ 26 ] which is based only on the consideration of predictive capabilities of models comparing with the gain of the model over the level of random (chance) accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…The determination of ADs is therefore of great importance as it is explicitly requested in the validation processes put in place at the OECD level [ 39 ].…”
Section: Resultsmentioning
confidence: 99%
“…The reliability of a QSAR model depends on its capacity to achieve confident predictions of new compounds not considered in the model development, which implies the necessity of considering the applicability domain (AD) of the model. Several AD analyses including the descriptor’s range, leverages, Euclidian distances, rivality and modelability indexes [ 66 , 67 , 68 ] are available. In this study we used two of AD approaches; maximal Euclidian distance of training set in NN-C (EDcrt = 6.16) and NN-D model (ED crt = 2.36), and leverage values in Q-D model (hat* = 0.4) ( Figure 4 ) [ 67 , 68 ].…”
Section: Resultsmentioning
confidence: 99%