Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion 2022
DOI: 10.18653/v1/2022.ltedi-1.1
|View full text |Cite
|
Sign up to set email alerts
|

Mind the data gap(s): Investigating power in speech and language datasets

Nina Markl

Abstract: Algorithmic oppression is an urgent and persistent problem in speech and language technologies. Considering power relations embedded in datasets before compiling or using them to train or test speech and language technologies is essential to designing less harmful, more just technologies. This paper presents a reflective exercise to recognise and challenge gaps and the power relations they reveal in speech and language datasets by applying principles of Data Feminism and Design Justice, and building on work on… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 27 publications
0
1
0
Order By: Relevance
“…A comprehensive discussion of stakeholders, emphasizing their relative power, is crucial for understanding the associated risks. As various researchers have articulated, it is essential to underscore power inequities by considering what might be absent from a dataset [62,102]. We build upon this observation, and various other insights on the relations between power structures and socio-technical algorithmic systems [21,84,86], structuring our analysis around the inclusion or exclusion of various groups in the development and deployment of these models.…”
Section: Stakeholders and Power Dynamicsmentioning
confidence: 99%
“…A comprehensive discussion of stakeholders, emphasizing their relative power, is crucial for understanding the associated risks. As various researchers have articulated, it is essential to underscore power inequities by considering what might be absent from a dataset [62,102]. We build upon this observation, and various other insights on the relations between power structures and socio-technical algorithmic systems [21,84,86], structuring our analysis around the inclusion or exclusion of various groups in the development and deployment of these models.…”
Section: Stakeholders and Power Dynamicsmentioning
confidence: 99%