Bias in machine learning software: why? how? what to do?

Chakraborty, Joymallya; Majumder, Suvodeep; Menzies, Tim

doi:10.1145/3468264.3468537

Cited by 103 publications

(69 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…ML software is developed following the data-driven programming paradigm. Therefore, data determine the decision logic of ML software to a large extent [17], and data bias is considered a main root cause of ML software bias [48]. Data testing aims to detect different types of data bias, including checking whether the labels of training data are biased (label bias) [35], whether the distribution of training data implies an unexpected correlation between the sensitive attribute and the outcome label (selection bias) [49], whether the features of training data contain bias (feature bias) [50].…”

Section: Fairness Testing Componentsmentioning

confidence: 99%

“…For example, engineers cannot judge whether a system is fair to women if they are unaware of the outcomes that the system provides to men. In practice, metamorphic relations and statistical measurements are adopted to tackle the oracle problem of fairness testing [48], [55].…”

Section: Software Testing Vs Fairness Testingmentioning

confidence: 99%

“…For example, for demographic parity, researchers calculate the favorable rate among different demographic groups and detect fairness violations by comparing these rates. If the rate difference, called Statistical Parity Difference (SPD) in the software fairness literature [35], [38], [48], [50], [118], is beyond a threshold, the software under test is identified as containing fairness bugs.…”

Section: Statistical Measurements As Test Oraclesmentioning

confidence: 99%

“…To mitigate label bias, Chakraborty et al [35], [48] leveraged situation testing to identify biased data points and remove them from the training data. Specifically, they divided the dataset into the privileged and unprivileged groups based on the sensitive attribute.…”

Section: Data Testingmentioning

confidence: 99%

See 3 more Smart Citations

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Chen¹,

Zhang²,

Hort³

et al. 2022

Preprint

View full text Add to dashboard Cite

Software systems are vulnerable to fairness bugs and frequently exhibit unfair behaviors, making software fairness an increasingly important concern for software engineers. Research has focused on helping software engineers to detect fairness bugs automatically. This paper provides a comprehensive survey of existing research on fairness testing. We collect 113 papers and organise them based on the testing workflow (i.e., the testing activities) and the testing components (i.e., where to find fairness bugs) for conducting fairness testing. We also analyze the research focus, trends, promising directions, as well as widely-adopted datasets and open source tools for fairness testing.

show abstract

Section: Fairness Testing Componentsmentioning

confidence: 99%

Section: Software Testing Vs Fairness Testingmentioning

confidence: 99%

Section: Statistical Measurements As Test Oraclesmentioning

confidence: 99%

Section: Data Testingmentioning

confidence: 99%

See 2 more Smart Citations

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Chen¹,

Zhang²,

Hort³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Reject option classification [24] is a post-processing strategy that translates favorable outcomes from the privileged group to the unprivileged group and unfavorable outcomes from the unprivileged group to the privileged group based upon a certain level of confidence and uncertainty. Chakraborty et al proposed Fair-SMOTE [25], a pre-processing and inprocessing approach, which balances class and label distributions and performs situation testing (i.e testing individual fairness through alternate "worlds").…”

Section: Related Workmentioning

confidence: 99%

Developing a novel fair-loan-predictor through a multi-sensitive debiasing pipeline: DualFair

Singh¹,

Singh²,

Khan³

et al. 2021

Preprint

View full text Add to dashboard Cite

Machine learning (ML) models are increasingly used for high-stake applications that can greatly impact people's lives. Despite their use, these models have the potential to be biased towards certain social groups on the basis of race, gender, or ethnicity. Many prior works have attempted to mitigate this "model discrimination" by updating the training data (pre-processing), altering the model learning process (inprocessing), or manipulating model output (post-processing). However, these works have not yet been extended to the realm of multi-sensitive parameters and sensitive options (MSPSO), where sensitive parameters are attributes that can be discriminated against (e.g race) and sensitive options are options within sensitive parameters (e.g black or white), thus giving them limited real-world usability. Prior work in fairness has also suffered from an accuracy-fairness tradeoff that prevents both the accuracy and fairness from being high. Moreover, previous literature has failed to provide holistic fairness metrics that work with MSPSO. In this paper, we solve all three of these problems by (a) creating a novel bias mitigation technique called DualFair and (b) developing a new fairness metric (i.e. AWI) that can handle MSPSO. Lastly, we test our novel mitigation method using a comprehensive U.S mortgage lending dataset and show that our classifier, or fair loan predictor, obtains better fairness and accuracy metrics than current state-of-the-art models.

show abstract