Davide Ballabio scite author profile

Abstract:One of the OECD principles for model validation requires defining the Applicability Domain (AD) for the QSAR models. This is important since the reliable predictions are generally limited to query chemicals structurally similar to the training compounds used to build the model. Therefore, characterization of interpolation space is significant in defining the AD and in this study some existing descriptor-based approaches performing this task are discussed and compared by implementing them on existing validated datasets from the literature. Algorithms adopted by different approaches allow defining the interpolation space in several ways, while defined thresholds contribute significantly to the extrapolations. For each dataset and approach implemented for this study, the comparison analysis was carried out by considering the model statistics and relative position of test set with respect to the training space.

show abstract

Comments on the Definition of the Q² Parameter for QSAR Validation

Consonni

Ballabio

Todeschini

2009

J. Chem. Inf. Model.

491

298

View full text Add to dashboard Cite

This paper deals with the problem of evaluating the predictive ability of QSAR models and continues the discussion about proper estimates of the predictive ability from an external evaluation set reported in Schüürmann G., Ebert R.-U., et al. External Validation and Prediction Employing the Predictive Squared Correlation Coefficient--Test Set Activity Mean vs Training Set Activity Mean. J. Chem. Inf. Model. 2008, 48, 2140-2145 . The two formulas for calculating the predictive squared correlation coefficient Q2 previously discussed by Schüürmann et al. are one that adopted by the current OECD guidelines about QSAR validation and based on SS (sum of squares) of the external test set referring to the training set response mean and the other based on SS of the external test set referring to the test set response mean. In addition to these two formulas, another formula is evaluated here, based on SS referring to mean deviations of observed values from the training set mean over the training set instead of the external evaluation set.

show abstract

Evaluation of model predictive ability by external validation techniques

Consonni

Ballabio

Todeschini

2010

Journal of Chemometrics

298

159

View full text Add to dashboard Cite

a This paper deals with the problem of evaluating the predictive ability of regression models. In some cases, model validation by internal cross-validation technique is not enough and validation by an external test set has been suggested as an effective way of evaluating the model predictive ability. Different functions for calculating the predictive squared correlation coefficient Q 2 from an external set were proposed, which lead to occasionally different estimates of the model predictive ability and therefore to contrasting decisions about model adequacy. In this paper, advantages and drawbacks of these functions in estimating model predictive ability from some simulated datasets are discussed by comparison.

show abstract

Quantitative Structure–Activity Relationship Models for Ready Biodegradability of Chemicals

Mansouri

Ringsted

Ballabio

et al. 2013

J. Chem. Inf. Model.

174

126

View full text Add to dashboard Cite

The European REACH regulation requires information on ready biodegradation, which is a screening test to assess the biodegradability of chemicals. At the same time REACH encourages the use of alternatives to animal testing which includes predictions from quantitative structure-activity relationship (QSAR) models. The aim of this study was to build QSAR models to predict ready biodegradation of chemicals by using different modeling methods and types of molecular descriptors. Particular attention was given to data screening and validation procedures in order to build predictive models. Experimental values of 1055 chemicals were collected from the webpage of the National Institute of Technology and Evaluation of Japan (NITE): 837 and 218 molecules were used for calibration and testing purposes, respectively. In addition, models were further evaluated using an external validation set consisting of 670 molecules. Classification models were produced in order to discriminate biodegradable and nonbiodegradable chemicals by means of different mathematical methods: k nearest neighbors, partial least squares discriminant analysis, and support vector machines, as well as their consensus models. The proposed models and the derived consensus analysis demonstrated good classification performances with respect to already published QSAR models on biodegradation. Relationships between the molecular descriptors selected in each QSAR model and biodegradability were evaluated.

show abstract

CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity

Mansouri

Kleinstreuer

Abdelaziz

et al. 2020

Environ Health Perspect

129

116

View full text Add to dashboard Cite

BACKGROUND: Endocrine disrupting chemicals (EDCs) are xenobiotics that mimic the interaction of natural hormones and alter synthesis, transport, or metabolic pathways. The prospect of EDCs causing adverse health effects in humans and wildlife has led to the development of scientific and regulatory approaches for evaluating bioactivity. This need is being addressed using high-throughput screening (HTS) in vitro approaches and computational modeling. OBJECTIVES: In support of the Endocrine Disruptor Screening Program, the U.S. Environmental Protection Agency (EPA) led two worldwide consortiums to virtually screen chemicals for their potential estrogenic and androgenic activities. Here, we describe the Collaborative Modeling Project for Androgen Receptor Activity (CoMPARA) efforts, which follows the steps of the Collaborative Estrogen Receptor Activity Prediction Project (CERAPP).

show abstract

Multivariate comparison of classification performance measures

Ballabio

Grisoni

Todeschini

2018

Chemometrics and Intelligent Laboratory Systems

210

113

View full text Add to dashboard Cite

Geographical origin and authentication of extra virgin olive oils by an electronic nose in combination with artificial neural networks

Cosio

Ballabio

Benedetti

et al. 2006

Analytica Chimica Acta

147

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Davide Ballabio

Classification tools in chemistry. Part 1: linear models. PLS-DA

Comparison of Different Approaches to Define the Applicability Domain of QSAR Models

Comments on the Definition of the Q² Parameter for QSAR Validation

Evaluation of model predictive ability by external validation techniques

Quantitative Structure–Activity Relationship Models for Ready Biodegradability of Chemicals

CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity

Multivariate comparison of classification performance measures

Geographical origin and authentication of extra virgin olive oils by an electronic nose in combination with artificial neural networks

Contact Info

Product

Resources

About

Davide Ballabio

Classification tools in chemistry. Part 1: linear models. PLS-DA

Comparison of Different Approaches to Define the Applicability Domain of QSAR Models

Comments on the Definition of the Q2 Parameter for QSAR Validation

Evaluation of model predictive ability by external validation techniques

Quantitative Structure–Activity Relationship Models for Ready Biodegradability of Chemicals

CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity

Multivariate comparison of classification performance measures

Geographical origin and authentication of extra virgin olive oils by an electronic nose in combination with artificial neural networks

Contact Info

Product

Resources

About

Comments on the Definition of the Q² Parameter for QSAR Validation