Generative Counterfactuals for Neural Networks via Attribute-Informed Perturbation

Yang, Fan; Liu, Ninghao; Du, Mengnan; Hu, Xia

doi:10.48550/arxiv.2101.06930

Cited by 4 publications

(4 citation statements)

References 29 publications

(26 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The first family of methods conditions the generative model on attributes, by e.g. using a conditional GAN [26,33,50,59,60]. This dependency on attribute information can restrict the applicability of these methods in scenarios where annotations are scarce.…”

Section: Related Workmentioning

confidence: 99%

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

Rodríguez

Caccia

Lacoste

et al. 2021

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

Explainability for machine learning models has gained considerable attention within our research community given the importance of deploying more reliable machinelearning systems. In computer vision applications, generative counterfactual methods indicate how to perturb a model's input to change its prediction, providing details about the model's decision-making. Current counterfactual methods make ambiguous interpretations as they combine multiple biases of the model and the data in a single counterfactual interpretation of the model's decision. Moreover, these methods tend to generate trivial counterfactuals about the model's decision, as they often suggest to exaggerate or remove the presence of the attribute being classified. For the machine learning practitioner, these types of counterfactuals offer little value, since they provide no new information about undesired model or data biases. In this work, we propose a counterfactual method that learns a perturbation in a disentangled latent space that is constrained using a diversity-enforcing loss to uncover multiple valuable explanations about the model's prediction. Further, we introduce a mechanism to prevent the model from producing trivial explanations. Experiments on CelebA and Synbols demonstrate that our model improves the success rate of producing high-quality valuable explanations when compared to previous state-of-the-art methods. We will publish the code.

show abstract

Section: Related Workmentioning

confidence: 99%

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

Rodríguez

Caccia

Lacoste

et al. 2021

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

show abstract

“…This is not a popular feature in the literature, likely because it is not an easy task to produce appropriate comparisons. Counterfactual approaches (such as [14,34,28]) are a particular case where the contrast is based on synthetic situations (values of features/attributes); another example is that of balanced explanations, which appear in [10].…”

Section: A Survey Of Relevant Xai Approaches In the Literaturementioning

confidence: 99%

On the Importance of Domain-specific Explanations in AI-based Cybersecurity Systems (Technical Report)

Paredes,

Teze,

Simari

et al. 2021

Preprint

View full text Add to dashboard Cite

With the availability of large datasets and ever-increasing computing power, there has been a growing use of data-driven artificial intelligence systems, which have shown their potential for successful application in diverse areas. However, many of these systems are not able to provide information about the rationale behind their decisions to their users. Lack of understanding of such decisions can be a major drawback, especially in critical domains such as those related to cybersecurity. In light of this problem, in this paper we make three contributions: (i) proposal and discussion of desiderata for the explanation of outputs generated by AI-based cybersecurity systems; (ii) a comparative analysis of approaches in the literature on Explainable Artificial Intelligence (XAI) under the lens of both our desiderata and further dimensions that are typically used for examining XAI approaches; and (iii) a general architecture that can serve as a roadmap for guiding research efforts towards the development of explainable AI-based cybersecurity systems-at its core, this roadmap proposes combinations of several research lines in a novel way towards tackling the unique challenges that arise in this context.

show abstract

“…These techniques are very related to the previously discussed perturbationbased mechanisms, but they come with a stronger guarantee, namely that the perturbations are guaranteed to change the model's prediction. The generation of counterfactual explanations has received significant attention in the NLP community [22,23,36,45,46]. In the simplest case, these counterfactuals can be generated by deleting words from the input text [23] or via rewrite-rules such as adding negations or shuffling words [45].…”

Section: Related Workmentioning

confidence: 99%

Counterfactual Explanations for Models of Code

Cito¹,

Dillig²,

Murali³

et al. 2021

Preprint

View full text Add to dashboard Cite

Machine learning (ML) models play an increasingly prevalent role in many software engineering tasks. However, because most models are now powered by opaque deep neural networks, it can be difficult for developers to understand why the model came to a certain conclusion and how to act upon the model's prediction. Motivated by this problem, this paper explores counterfactual explanations for models of source code. Such counterfactual explanations constitute minimal changes to the source code under which the model "changes its mind". We integrate counterfactual explanation generation to models of source code in a real-world setting. We describe considerations that impact both the ability to find realistic and plausible counterfactual explanations, as well as the usefulness of such explanation to the user of the model. In a series of experiments we investigate the efficacy of our approach on three different models, each based on a BERT-like architecture operating over source code.

show abstract

Generative Counterfactuals for Neural Networks via Attribute-Informed Perturbation

Cited by 4 publications

References 29 publications

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

On the Importance of Domain-specific Explanations in AI-based Cybersecurity Systems (Technical Report)

Counterfactual Explanations for Models of Code

Contact Info

Product

Resources

About