2022
DOI: 10.1002/widm.1452
|View full text |Cite
|
Sign up to set email alerts
|

A survey on datasets for fairness‐aware machine learning

Abstract: As decision‐making increasingly relies on machine learning (ML) and (big) data, the issue of fairness in data‐driven artificial intelligence systems is receiving increasing attention from both research and industry. A large variety of fairness‐aware ML solutions have been proposed which involve fairness‐related interventions in the data, learning algorithms, and/or model outputs. However, a vital part of proposing new approaches is evaluating them empirically on benchmark datasets that represent realistic and … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
26
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 81 publications
(44 citation statements)
references
References 95 publications
(113 reference statements)
0
26
0
Order By: Relevance
“…COMPAS 3 : This dataset describes the task of predicting the recidivism of individuals in the U.S. Both “sex” and “race” could be the sensitive attributes of this dataset (Le Quy et al, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…COMPAS 3 : This dataset describes the task of predicting the recidivism of individuals in the U.S. Both “sex” and “race” could be the sensitive attributes of this dataset (Le Quy et al, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…In this study, we focus on tabular datasets, mostly used in fairness-aware machine learning research (Le Quy et al, 2022). We use datasets from the financial (Adult Income dataset), criminological (Catalonia Juvenile dataset, Crimes and Communities dataset) and the educational (Student performance dataset, Law admission dataset) domain.…”
Section: Methodsmentioning
confidence: 99%
“…Classification tasks on tabular data have been widely studied in the machine learning literature [23], [24]. One popular approach for such tasks is the use of decision tree-based algorithms, such as Random Forests [25] and XGBoost [26].…”
Section: Related Workmentioning
confidence: 99%