The Use and Misuse of Counterfactuals in Ethical Machine Learning

Kasirzadeh, Atoosa; Smart, Andrew

doi:10.1145/3442188.3445886

Cited by 68 publications

(61 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Unfortunately, this fairness metric also comes with some difficulties in analyzing and evaluating the counterfactual statements. See Kasirzadeh and Smart [26] for some principled arguments against the prevalent use of counterfactual fairness in social contexts.…”

Section: Interpreting the Fairness Principle In Light Of Fairness Metricsmentioning

confidence: 99%

Fairness and Data Protection Impact Assessments

Kasirzadeh

Clifford

2021

Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society

Self Cite

View full text Add to dashboard Cite

In this paper, we critically examine the effectiveness of the requirement to conduct a Data Protection Impact Assessment (DPIA) in Article 35 of the General Data Protection Regulation (GDPR) in light of fairness metrics. Through this analysis, we explore the role of the fairness principle as introduced in Article 5(1)(a) and its multifaceted interpretation in the obligation to conduct a DPIA. Our paper argues that although there is a significant theoretical role for the considerations of fairness in the DPIA process, an analysis of the various guidance documents issued by data protection authorities on the obligation to conduct a DPIA reveals that they rarely mention the fairness principle in practice. Our analysis questions this omission, and assesses the capacity of fairness metrics to be truly operationalized within DPIAs. We conclude by exploring the practical effectiveness of DPIA with particular reference to (1) technical challenges that have an impact on the usefulness of DPIAs irrespective of a controller's willingness to actively engage in the process, (2) the context dependent nature of the fairness principle, and (3) the key role played by data controllers in the determination of what is fair. CCS CONCEPTS• Computing methodologies → Artificial intelligence; Philosophical/theoretical foundations of artificial intelligence; • Social and professional topics → Governmental regulations;• Computer systems organization → Embedded systems.

show abstract

Section: Interpreting the Fairness Principle In Light Of Fairness Metricsmentioning

confidence: 99%

Fairness and Data Protection Impact Assessments

Kasirzadeh

Clifford

2021

Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society

Self Cite

View full text Add to dashboard Cite

show abstract

“…There are also concerns about considering categories such as race or gender as a cause [30,26,15,22]. From one perspective, most of these attributes are determined at the time of an individual's conception and are modeled as source nodes in a causal graph which can directly or indirectly influence the descendent variables.…”

Section: Introductionmentioning

confidence: 99%

“…Alternatively, many view attributes such as race or gender as social constructs that evolve over the course an individual's life. Recently, [15,22] studied epistemological and ontological aspects of counterfactuals in the context of fairness evaluation. In [15], the authors argue that social categories such as race may not admit counterfactual manipulation.…”

Section: Introductionmentioning

confidence: 99%

Promises and Challenges of Causality for Ethical Machine Learning

Rahmattalabi¹,

Xiang²

2022

Preprint

View full text Add to dashboard Cite

In recent years, there has been increasing interest in using causal reasoning for designing fair decision-making systems due to its compatibility with legal frameworks, interpretability for human stakeholders, and robustness to spurious correlations inherent in observational data, among other factors. This recent attention to causal fairness, however, has been accompanied with great skepticism due to the practical and epistemological challenges with applying current causal fairness approaches proposed in the literature. Motivated by the long-standing empirical work on causality in econometrics, social sciences, and biomedical sciences, in this paper we lay out the conditions for appropriate application of causal fairness under the "potential outcomes framework." We highlight key aspects of causal inference that are often ignored in the causal fairness literature. In particular, we discuss the importance of specifying the nature and timing of proposed hypothetical interventions on social categories such as race or gender. Precisely, instead of postulating an intervention on immutable attributes, we propose a shift in focus to their perceptions and discuss the implications for fairness evaluation. We argue that such conceptualization of the hypothetical intervention is key in evaluating the validity of the causal assumptions and conducting sound causal analysis including avoiding post-treatment bias. Subsequently, we illustrate how causality can address the limitations of existing fairness metrics, including those that depend upon statistical correlations. Specifically, we introduce causal variants of common statistical notions of fairness, and we make a novel observation that under the causal framework there is no fundamental disagreement between different notions of fairness. Finally, we conduct extensive experiments where we demonstrate our approach for evaluating and mitigating unfairness, specially when post-treatment variables are present.

show abstract

“…One suggested solution to address the issue of spurious features is counterfactually augmented data (CAD)-instances generated by human annotators that are minimally edited to flip their labeland their variations such as iterative benchmark design (Potts et al, 2020), contrast data generation (Gardner et al, 2020), 1 and their combination . Drawing on the rich history of counterfactuals (Pearl, 2018;Lewis, 2013;Kasirzadeh and Smart, 2021), the promise of CAD is to offer a causality-based framework where only cues that are meaningfully associated with the construct are edited -which is expected to be conducive to models learning less spurious features. Indeed, recent work has shown that models trained on CAD generalize better out of domain (Kaushik et al, 2020;Samory et al, 2021).…”

Section: Introductionmentioning

confidence: 99%

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

Sen¹,

Samory²,

Floeck³

et al. 2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

As NLP models are increasingly deployed in socially situated settings such as online abusive content detection, it is crucial to ensure that these models are robust. One way of improving model robustness is to generate counterfactually augmented data (CAD) for training models that can better learn to distinguish between core features and data artifacts. While models trained on this type of data have shown promising out-of-domain generalizability, it is still unclear what the sources of such improvements are. We investigate the benefits of CAD for social NLP models by focusing on three social computing constructs -sentiment, sexism, and hate speech. Assessing the performance of models trained with and without CAD across different types of datasets, we find that while models trained on CAD show lower in-domain performance, they generalize better out-of-domain. We unpack this apparent discrepancy using machine explanations and find that CAD reduces model reliance on spurious features. Leveraging a novel typology of CAD to analyze their relationship with model performance, we find that CAD which acts on the construct directly or a diverse set of CAD leads to higher performance.

show abstract

The Use and Misuse of Counterfactuals in Ethical Machine Learning

Cited by 68 publications

References 39 publications

Fairness and Data Protection Impact Assessments

Fairness and Data Protection Impact Assessments

Promises and Challenges of Causality for Ethical Machine Learning

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

Contact Info

Product

Resources

About