Xavier Renard scite author profile

Post-hoc interpretability approaches have been proven to be powerful tools to generate explanations for the predictions made by a trained blackbox model. However, they create the risk of having explanations that are a result of some artifacts learned by the model instead of actual knowledge from the data. This paper focuses on the case of counterfactual explanations and asks whether the generated instances can be justified, i.e. continuously connected to some ground-truth data. We evaluate the risk of generating unjustified counterfactual examples by investigating the local neighborhoods of instances whose predictions are to be explained and show that this risk is quite high for several datasets. Furthermore, we show that most state of the art approaches do not differentiate justified from unjustified counterfactual examples, leading to less useful explanations.

show abstract

Comparison-Based Inverse Classification for Interpretability in Machine Learning

Laugel

Lesot

Marsala

et al. 2018

View full text Add to dashboard Cite

In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself, nor on the processed data (neither the training nor the test data). It proposes an instance-based approach whose principle consists in determining the minimal changes needed to alter a prediction: given a data point whose classification must be explained, the proposed method consists in identifying a close neighbour classified differently, where the closeness definition integrates a sparsity constraint. This principle is implemented using observation generation in the Growing Spheres algorithm. Experimental results on two datasets illustrate the relevance of the proposed approach that can be used to gain knowledge about the classifier.

show abstract

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Laugel¹,

Lesot²,

Marsala³

et al. 2019

Preprint

View full text Add to dashboard Cite

Unjustified Classification Regions and Counterfactual Explanations in Machine Learning

Laugel¹,

Lesot²,

Marsala³

et al. 2020

View full text Add to dashboard Cite

Defining Locality for Surrogates in Post-hoc Interpretablity

Laugel¹,

Renard²,

Lesot³

et al. 2018

Preprint

View full text Add to dashboard Cite

Random-shapelet: An algorithm for fast shapelet discovery

Renard

Rifqi

Erray

et al. 2015

View full text Add to dashboard Cite

Time series shapelets proposes an approach to extract subsequences most suitable to discriminate time series belonging to distinct classes.Computational complexity is the major issue with shapelets: the time required to identify interesting subsequences can be intractable for large cases. In fact, it is required to evaluate all the subsequences of all the time series of the training dataset. In the literature, improvements have been proposed to accelerate the process, but few provide a solution that dramatically reduces the time required to find a solution.We propose a random-based approach that reduces the time necessary to find a solution, in our experimentation until 3 orders of magnitude compared to the original method.Based on extensive experimentations on several data sets from the literature, we show that even with a few time available, random-shapelet algorithm is able to find very competitive shapelets.

show abstract

How to Choose an Explainability Method? Towards a Methodical Implementation of XAI in Practice

Vermeire

Laugel

Renard

et al. 2021

View full text Add to dashboard Cite

Concept Tree: High-Level Representation of Variables for More Interpretable Surrogate Decision Trees

Renard¹,

Woloszko²,

Aigrain³

et al. 2019

Preprint

View full text Add to dashboard Cite

Interpretable surrogates of black-box predictors trained on high-dimensional tabular datasets can struggle to generate comprehensible explanations in the presence of correlated variables. We propose a model-agnostic interpretable surrogate that provides global and local explanations of black-box classifiers to address this issue. We introduce the idea of concepts as intuitive groupings of variables that are either defined by a domain expert or automatically discovered using correlation coefficients. Concepts are embedded in a surrogate decision tree to enhance its comprehensibility. First experiments on FRED-MD, a macroeconomic database with 134 variables, show improvement in humaninterpretability while accuracy and fidelity of the surrogate model are preserved.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xavier Renard

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Comparison-Based Inverse Classification for Interpretability in Machine Learning

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Unjustified Classification Regions and Counterfactual Explanations in Machine Learning

Defining Locality for Surrogates in Post-hoc Interpretablity

Random-shapelet: An algorithm for fast shapelet discovery

How to Choose an Explainability Method? Towards a Methodical Implementation of XAI in Practice

Concept Tree: High-Level Representation of Variables for More Interpretable Surrogate Decision Trees

Contact Info

Product

Resources

About