xFAIR: Better Fairness via Model-based Rebalancing of Protected Attributes

Peng, Kewen; Chakraborty, Joymallya

doi:10.48550/arxiv.2110.01109

Cited by 3 publications

(4 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Peng et al [144] used logistic regression and decision tree algorithms as models to extrapolate the correlations among dependent variables that might cause bias in training data.…”

Section: Data Testingmentioning

confidence: 99%

“…Usage scenario Description Link FairTest [140] General ML software Analyzing associations between software outcomes and sensitive attributes [196] Themis [55] General ML software Black-box random discriminatory instance generation [197] Aequitas [78] General ML software Black-box search-based discriminatory instance generation [198] ExpGA [77] General ML software Black-box search-based discriminatory instance generation [199] fairCheck [89] General ML software Verification-based discriminatory instance generation [200] MLCheck [88] General ML software Verification-based discriminatory instance generation [201] LTDD [50] General ML software Detecting which data features and which parts of them are biased [202] Fair-SMOTE [48] General ML software Detecting biased data labels and data distributions [203] xFAIR [144] General ML software Extrapolation of correlations among data features that might cause bias [204] Fairway [35] General ML software Detecting biased data labels and optimal hyper-parameters for ML fairness [205] Parfait-ML [46] General ML software Searching for hyper-parameters optimal to ML software fairness [206] Fairea [38] General ML software Testing fairness repair algorithms [207] IBM AIF360 [161] General ML software Examining and mitigating discrimination and bias in ML software [119] scikit-fairness [208] General ML software Examining and mitigating discrimination and bias in ML software [208] LiFT [209] General ML software Examining and mitigating discrimination and bias in ML software [210] SageMaker Clarify [211] General ML software Measuring bias that occurs in each stage of the ML life cycle [212] FairVis [213] General ML software Visual analytics for discovering intersectional bias in ML software [214] FairRepair [155] Tree-based classifiers Detecting paths responsible for unfairness in tree-based classifiers [215] ADF…”

Section: Tool [Ref]mentioning

confidence: 99%

See 1 more Smart Citation

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Chen¹,

Zhang²,

Hort³

et al. 2022

Preprint

View full text Add to dashboard Cite

Software systems are vulnerable to fairness bugs and frequently exhibit unfair behaviors, making software fairness an increasingly important concern for software engineers. Research has focused on helping software engineers to detect fairness bugs automatically. This paper provides a comprehensive survey of existing research on fairness testing. We collect 113 papers and organise them based on the testing workflow (i.e., the testing activities) and the testing components (i.e., where to find fairness bugs) for conducting fairness testing. We also analyze the research focus, trends, promising directions, as well as widely-adopted datasets and open source tools for fairness testing.

show abstract

“…Peng et al [144] used logistic regression and decision tree algorithms as models to extrapolate the correlations among dependent variables that might cause bias in training data.…”

Section: Data Testingmentioning

confidence: 99%

Section: Tool [Ref]mentioning

confidence: 99%

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Chen¹,

Zhang²,

Hort³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…The sole idea of the above approach is to analyze and mitigate the bias defined by the protected subgroup of these sen-sitive features to estimate the direct discrimination. Similarly other previous studies have mainly relied upon simple statistical analysis involving association or correlation measures [16,48,70,51,60]. However, such analyses can lead to incorrect conclusions because they largely ignore the effect of confounding variables;-variables that can be used to determine both the outcome and the feature pairs.…”

Section: Bias and Fairnessmentioning

confidence: 99%

Studying the Practices of Deploying Machine Learning Projects on Docker

Openja

Majidi

Khomh

et al. 2022

The International Conference on Evaluation and Assessment in Software Engineering 2022

View full text Add to dashboard Cite

Docker is a containerization service that allows for convenient deployment of websites, databases, applications' APIs, and machine learning (ML) models with a few lines of code. Studies have recently explored the use of Docker for deploying general software projects with no specific focus on how Docker is used to deploy ML-based projects. In this study, we conducted an exploratory study to understand how Docker is being used to deploy ML-based projects. As the initial step, we examined the categories of ML-based projects that use Docker. We then examined why and how these projects use Docker, and the characteristics of the resulting Docker images. Our results indicate that six categories of ML-based projects use Docker for deployment, including ML Applications, MLOps/ AIOps, Toolkits, DL Frameworks, Models, and Documentation. We derived the taxonomy of 21 major categories representing the purposes of using Docker, including those specific to models such as model management tasks (e.g., testing, training). We then showed that ML engineers use Docker images mostly to help with the platform portability, such as transferring the software across the operating systems, runtimes such as GPU, and language constraints. However, we also found that more resources may be required to run the Docker images for building ML-based software projects due to the large number of files contained in the image layers with deeply nested directories. We hope to shed light on the emerging practices of deploying ML software projects using containers and highlight aspects that should be improved. CCS Concepts• Machine Learning → Deployment; • >Machine Learning → Docker.

show abstract

“…On the other hand, several techniques are focused in modifying the dataset according to protected data in order to mitigate bias, mainly based on rebalancing techniques [11] [15]. [9] try to identify bias in the labels and proposes a method based on the re-weighting of the elements in the dataset to mitigate such bias.…”

Section: Related Workmentioning

confidence: 99%

Law Modeling for Fairness Requirements Elicitation in Artificial Intelligence Systems

Lavalle

Maté²,

Trujillo³

et al. 2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

As Artificial Intelligence (AI) algorithms become widespread, concerns rise regarding their potential discrimination using protected attributes such as race, sex, or age among others. Fairness is a quality highly desired by society, however, it can be really difficult to achieve. Identifying protected attributes and trying to maintain fairness is both an important as well as a challenging task. In this paper, we propose a dynamic framework that considers the project context together with law modeling in order to identify protected attributes that threaten the fairness of AI models. This leads to a conscious evaluation of both the accuracy and fairness of the AI solutions developed. To this aim, we propose to model legal requirements by using Nòmos 3, which allows us to capture the legal requirements that should be fulfilled into legal contexts that can be loaded into our framework. By following our proposal we (i) map the duties in legal requirements to attributes in the used dataset, identifying protected attributes and providing traceability, (ii) help users in the selection of the definition of fairness that best suits the context at hand, (iii) represent the output of AI models visually in order to allow users to interpret how correct and fair are the decisions achieved by the model. To show the applicability of our proposal, we exemplify its application through a illustrative use case.

show abstract

xFAIR: Better Fairness via Model-based Rebalancing of Protected Attributes

Cited by 3 publications

References 22 publications

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Studying the Practices of Deploying Machine Learning Projects on Docker

Law Modeling for Fairness Requirements Elicitation in Artificial Intelligence Systems

Contact Info

Product

Resources

About