Auditing black-box models for indirect influence

Adler, Philip D. F.; Falk, Casey; Friedler, Sorelle A.; Nix, Tionney; Rybeck, Gabriel; Scheidegger, Carlos; Smith, Brandon; Venkatasubramanian, Suresh

doi:10.1007/s10115-017-1116-3

Cited by 137 publications

(46 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Some methods transform the constrained optimization problem via the method of Lagrange multipliers [3,15,34,37,38,59,97,135,183,185] or add penalties to the objective [5,14,22,44,46,56,58,62,73,74,75,79,82,87,88,89,93,94,95,98,104,115,119,123,133,134,135,138,139,154,165,176,179,180,182], others use adversary techniques to maximize the system ability to predict the target while minimizing the ability to predict the sensitive attribute [189]. Post-Processing methods consist in transforming the model outputs in order to make them fair [2,7,…”

Section: Methods For Imposing Fairness In a Modelmentioning

confidence: 99%

“…Over the last few years, researchers have introduced a rich set of definitions formalizing different fairness desiderata that can be used for evaluating and designing ML systems [2,7,14,15,21,22,23,25,28,29,31,34,44,45,46,52,53,54,55,56,59,66,69,71,72,73,74,75,82,84,87,89,90,94,95,97,98,99,103,104,106,114,115,116,121,123,126,138,139,141,148,…”

Section: Causal Bayesian Network: An Essential Tool For Fairnessmentioning

confidence: 99%

“…where " • " is the inner product between two vectors in a Hilbert space . We can then learn the parameter vector by 2…”

Section: If the Explicit Use Of Sensitive Attributes Is Forbiddenmentioning

confidence: 99%

See 2 more Smart Citations

Fairness in Machine Learning

Oneto

Chiappa

2020

Studies in Computational Intelligence

View full text Add to dashboard Cite

Machine learning based systems are reaching society at large and in many aspects of everyday life. This phenomenon has been accompanied by concerns about the ethical issues that may arise from the adoption of these technologies. ML fairness is a recently established area of machine learning that studies how to ensure that biases in the data and model inaccuracies do not lead to models that treat individuals unfavorably on the basis of characteristics such as e.g. race, gender, disabilities, and sexual or political orientation. In this manuscript, we discuss some of the limitations present in the current reasoning about fairness and in methods that deal with it, and describe some work done by the authors to address them. More specifically, we show how causal Bayesian networks can play an important role to reason about and deal with fairness, especially in complex unfairness scenarios. We describe how optimal transport theory can be used to develop methods that impose constraints on the full shapes of distributions corresponding to different sensitive attributes, overcoming the limitation of most approaches that approximate fairness desiderata by imposing constraints on the lower order moments or other functions of those distributions. We present a unified framework that encompasses methods that can deal with different settings and fairness criteria, and that enjoys strong theoretical guarantees. We introduce an approach to learn fair representations that can generalize to unseen tasks. Finally, we describe a technique that accounts for legal restrictions about the use of sensitive attributes.

show abstract

Section: Methods For Imposing Fairness In a Modelmentioning

confidence: 99%

Section: Causal Bayesian Network: An Essential Tool For Fairnessmentioning

confidence: 99%

See 1 more Smart Citation

Fairness in Machine Learning

Oneto

Chiappa

2020

Studies in Computational Intelligence

View full text Add to dashboard Cite

show abstract

“…Another related post hoc technique called black box auditing (Adler et al, 2016) can be used to decide the extent to which a specific feature contributes to the accuracy (percentage of correct predictions) of a trained model. To quantify the direct effect of a feature, we can replace the feature by random noise and see how much the model accuracy drops.…”

Section: Explaining What Data Were Fed Into the Modelmentioning

confidence: 99%

Explaining Supervised Learning Models: A Preliminary Study on Binary Classifiers

Wang¹,

Bisantz²,

Bolton³

et al. 2020

Ergonomics in Design

View full text Add to dashboard Cite

The reach of artificial intelligence continues to grow, particularly with the expansion of machine learning techniques that capitalize on increased computing power. Such systems could have tremendous benefits by providing predictions and suggestions. However, they are limited by the fact that they offer incomplete explanations of their predictions to human decision makers. The objective of this work was to summarize general information that could help users make judgments about whether a system is trustworthy and whether the system’s training “makes sense.” A preliminary study was summarized to show the importance of iterative design and testing for visualizing explanations.

show abstract

“…Such techniques use an interpretable model and apply it to the predictions returned by the black box (Hall, Phan, & Ambati, 2017). Currently many different explanation approaches exist, for example, providing logical statements (Lakkaraju, Bach, & Leskovec, 2016; Su, Wei, Varshney, & Malioutov, 2015; Wang et al, 2015; Wang & Rudin, 2014), local models (Rüping, 2005; Turner, 2016) or feature importance (Adler et al, 2018; Datta, Sen, & Zick, 2016; Goldstein, Kapelner, Bleich, & Pitkin, 2015; Puolamäki & Ukkonen, 2017).…”

Section: Theoretical Backgroundmentioning

confidence: 99%

Are you sure? Prediction revision in automated decision‐making

2020

View full text Add to dashboard Cite

With the rapid improvements in machine learning and deep learning, decisions made by automated decision support systems (DSS) will increase. Besides the accuracy of predictions, their explainability becomes more important. The algorithms can construct complex mathematical prediction models. This causes insecurity to the predictions. The insecurity rises the need for equipping the algorithms with explanations. To examine how users trust automated DSS, an experiment was conducted. Our research aim is to examine how participants supported by an DSS revise their initial prediction by four varying approaches (treatments) in a between-subject design study. The four treatments differ in the degree of explainability to understand the predictions of the system. First we used an interpretable regression model, second a Random Forest (considered to be a black box [BB]), third the BB with a local explanation and last the BB with a global explanation. We noticed that all participants improved their predictions after receiving an advice whether it was a complete BB or an BB with an explanation. The major finding was that interpretable models were not incorporated more in the decision process than BB models or BB models with explanations.

show abstract

Auditing black-box models for indirect influence

Cited by 137 publications

References 30 publications

Fairness in Machine Learning

Fairness in Machine Learning

Explaining Supervised Learning Models: A Preliminary Study on Binary Classifiers

Are you sure? Prediction revision in automated decision‐making

Contact Info

Product

Resources

About