The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Laugel, Thibault; Lesot, Marie‐Jeanne; Marsala, Christophe; Renard, Xavier; Detyniecki, Marcin

doi:10.24963/ijcai.2019/388

Cited by 121 publications

(88 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A similar limitation is observed due to possibly high variability of the input. Laugel et al raise the issue of justification for counterfactual explanation [144]. They argue that a synthesized counterfactual data point must be connected to the training data.…”

Section: ) Explainability Methodsmentioning

confidence: 99%

“…interpretable intermediate predictors) mimic the local neighbourhood (i.e., fidelity) and the data example to be explained (i.e., hit). Laugel et al measure how justified counterfactuals are by averaging a binary score (one if the explanation is justified following the proposed definition, zero otherwise) over all the generated explanations [100], [144]. It is worth noting that the run-time of explanation generation algorithms is reported in addition to the evaluation metrics for several frameworks [132], [139], [146], [152], [156], [159].…”

Section: ) Evaluation Methodsmentioning

confidence: 99%

“…Indeed, contfactuals are particularly suitable for informing the end-user why a given data example is assigned a particular class label. Thus, the outlined classification-oriented frameworks are evaluated on classifiers based on logistic regression [55], [136], [153], [158], decision trees [46], [80], [122], [140], [150], [155], [159], gradient boosted decision trees [147], support vector machines [131], [138], [146], random forests [81], [86], [142]- [144], neural networks [6], [48], [49], [91], [129], [130], [133], [135], [139], [141], [145], [148], [151], or combinations of these [100], [105], [134], [152], [154], [160]. In three studies [67], [128], [137], the classifiers used in the experiments are not specified.…”

Section: ) Ai Problemmentioning

confidence: 99%

“…On the one hand, visual explanations for non-visual input data (i.e., datasets containing continuous or categorical featurevalue pairs) plot feature-value pair dependencies [134], [141], [142]. On the other hand, visual input data (i.e., images) are associated with saliency maps [133] or explained by contrastive patterns between the given data example and that of an opposing class in one iteration [130], [144] or a series thereof [49]; by depicting critical regions absent in the input data example that determine what lacks in the image to be classified differently [135]; or by visualizing spatial regions associated to data examples of opposed classes [139].…”

Section: ) Output Representationmentioning

confidence: 99%

See 3 more Smart Citations

A Survey of Contrastive and Counterfactual Explanation Generation Methods for Explainable Artificial Intelligence

et al. 2021

View full text Add to dashboard Cite

A number of algorithms in the field of artificial intelligence offer poorly interpretable decisions. To disclose the reasoning behind such algorithms, their output can be explained by means of so-called evidence-based (or factual) explanations. Alternatively, contrastive and counterfactual explanations justify why the output of the algorithms is not any different and how it could be changed, respectively. It is of crucial importance to bridge the gap between theoretical approaches to contrastive and counterfactual explanation and the corresponding computational frameworks. In this work we conduct a systematic literature review which provides readers with a thorough and reproducible analysis of the interdisciplinary research field under study. We first examine theoretical foundations of contrastive and counterfactual accounts of explanation. Then, we report the state-of-the-art computational frameworks for contrastive and counterfactual explanation generation. In addition, we analyze how grounded such frameworks are on the insights from the inspected theoretical approaches. As a result, we highlight a variety of properties of the approaches under study and reveal a number of shortcomings thereof. Moreover, we define a taxonomy regarding both theoretical and practical approaches to contrastive and counterfactual explanation.

show abstract

Section: ) Explainability Methodsmentioning

confidence: 99%

Section: ) Evaluation Methodsmentioning

confidence: 99%

Section: ) Ai Problemmentioning

confidence: 99%

Section: ) Output Representationmentioning

confidence: 99%

See 2 more Smart Citations

A Survey of Contrastive and Counterfactual Explanation Generation Methods for Explainable Artificial Intelligence

et al. 2021

View full text Add to dashboard Cite

show abstract

“…The XAI methods introduced so far produce a posteriori explanations of deep learning models. Although such post hoc interpretations have been shown to be useful, some argue that, ideally, XAI methods, should automatically offer human-interpretable explanation alongside their predictions 105 . Such approaches (herein referred to as 'self-explaining') would promote verification and error analysis, and be directly linkable with domain knowledge 106 .…”

Section: Box 2 | Xai Applied To Cytochrome P450-mediated Metabolismmentioning

confidence: 99%

Drug discovery with explainable artificial intelligence

2020

View full text Add to dashboard Cite

arious concepts of 'artificial intelligence' (AI) have been successfully adopted for computer-assisted drug discovery in the past few years 1-3. This advance is mostly owed to the ability of deep learning algorithms, that is, artificial neural networks with multiple processing layers, to model complex nonlinear inputoutput relationships, and perform pattern recognition and feature extraction from low-level data representations. Certain deep learning models have been shown to match or even exceed the performance of the familiar existing machine learning and quantitative structure-activity relationship (QSAR) methods for drug discovery 4-6. Moreover, deep learning has boosted the potential and broadened the applicability of computer-assisted discovery, for example, in molecular design 7,8 , chemical synthesis planning 9,10 , protein structure prediction 11 and macromolecular target identification 12,13. The ability to capture intricate nonlinear relationships between input data (for example, chemical structure representations) and the associated output (for example, assay readout) often comes at the price of limited comprehensibility of the resulting model. While there have been efforts to explain QSARs in terms of algorithmic insights and molecular descriptor analysis 14-19 , deep neural network models notoriously elude immediate accessibility by the human mind 20. In medicinal chemistry in particular, the availability of 'rules of thumb' correlating biological effects with physicochemical properties underscores the willingness, in certain situations, to sacrifice accuracy in favour of models that better fit human intuition 21-23. Thus, blurring the lines between the 'two QSARs' 24 (that is, mechanistically interpretable versus highly accurate models) may be key to accelerated drug discovery with AI 25. Automated analysis of medical and chemical knowledge to extract and represent features in a human-intelligible format dates back to the 1990s 26,27 , but has been receiving increasing attention due to the re-emergence of neural networks in chemistry and healthcare. Given the current pace of AI in drug discovery and related fields, there will be an increased demand for methods that help us understand and interpret the underlying models. In an effort to mitigate the lack of interpretability of certain machine learning models, and to augment human reasoning and decision-making, 28 , attention has been drawn to explainable AI (XAI) approaches 29,30. Providing informative explanations alongside the mathematical models aims to (1) render the underlying decision-making process transparent ('understandable') 31 , (2) avoid correct predictions for the wrong reasons (the so-called clever Hans effect) 32 , (3) avert unfair biases or unethical discrimination 33 and (4) bridge the gap between the machine learning community and other scientific disciplines. In addition, effective XAI can help scientists navigate 'cognitive valleys' 28 , allowing them to hone their knowledge and beliefs on the investigated process 34. While the e...

show abstract

Explainable machine learning in cybersecurity: A survey

Yan

Wen

Nepal

et al. 2022

Int J of Intelligent Sys

View full text Add to dashboard Cite

Machine learning (ML) techniques are increasingly important in cybersecurity, as they can quickly analyse and identify different types of threats from millions of events. In spite of the increasing number of possible applications of ML, successful adoption of ML models in cybersecurity still highly relies on the explainability of those models that are used for making predictions. Explanations that support ML model outputs are crucial in cybersecurity‐oriented ML applications because people need to get more information from the model than just binary output for analysis. The explainable models help ML developers solve the “trust” problem for a security application prediction in a faithful way: validating model behaviours, diagnosing misclassifications and sometimes automatically patching errors in the target models. Therefore, explainable ML for cybersecurity has become a necessary and important research branch. In this paper, we present the topic of explainable ML in cybersecurity through two general types of explanations: (1) ante hoc explanation, and (2) post hoc explanation, with their methodologies. We systematically review and categorise the state‐of‐the‐art research, and provide comparative studies to help researchers find the optimal solutions to specific problems. We further list open issues in this field to facilitate future studies. This survey will benefit diverse groups of readers from both academia and industries, who want to effectively use ML to solve cybersecurity challenges.

show abstract

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Cited by 121 publications

References 0 publications

A Survey of Contrastive and Counterfactual Explanation Generation Methods for Explainable Artificial Intelligence

A Survey of Contrastive and Counterfactual Explanation Generation Methods for Explainable Artificial Intelligence

Drug discovery with explainable artificial intelligence

Explainable machine learning in cybersecurity: A survey

Contact Info

Product

Resources

About