Language variation and algorithmic bias: understanding algorithmic bias in British English automatic speech recognition

Markl, Nina

doi:10.1145/3531146.3533117

Cited by 21 publications

(10 citation statements)

References 87 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Factors relating to a speaker's linguistic background, such as accent, can prove challenging for an automatic transcription system. Previous work has demonstrated that the performance of ASR systems declines significantly when confronted with speech that diverges from the "standard" variety; this has been found for non-native-accented speech in English (Meyer et al, 2020;DiChristofano et al, 2022;Markl, 2022) and Dutch (Feng et al, 2021), as well as for non-standard regionallyaccented speech in Brazilian Portuguese (Lima et al, 2019) and British English (Markl, 2022). Markl (2022) compared the performance of Google and Amazon transcription services across multiple accents of British English.…”

Section: Automatic Systems and Speaker Factorsmentioning

confidence: 99%

“…Many researchers have suggested that the composition of training datasets can cause bias within automatic systems (Tatman, 2017;Koenecke et al, 2020;Meyer et al, 2020;Feng et al, 2021) and that the underrepresentation of certain accents leads to a decline in performance for those varieties. Markl (2022) reports that certain substitution errors identified for speakers of nonstandard regional accents of British English suggest that there is an overrepresentation of Southern accents in the training data or that acoustic models are being trained only on more prestigious Southern varieties, such as Received Pronunciation. Similarly, Wassink et al (2022) claim that 20% of the errors within their data would be addressed by incorporating dialectal forms of ethnic varieties of American English (African American, ChicanX, and Native American) into the training of the automatic systems.…”

Section: Automatic Systems and Speaker Factorsmentioning

confidence: 99%

“…In recent years, a growing body of research has focused on systematic bias within automatic systems, i.e., underperformance for certain demographic groups, and significant disparities in performance have been demonstrated across accents. Transcripts tend to be significantly less accurate for non-native speakers (DiChristofano et al, 2022) or speakers of nonstandard regional varieties (Markl, 2022). However, a limitation of work in this area is the use of word error rate (WER) for evaluating performance.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Incorporating automatic speech recognition methods into the transcription of police-suspect interviews: factors affecting automatic performance

Harrington

2023

Front. Commun.

View full text Add to dashboard Cite

IntroductionIn England and Wales, transcripts of police-suspect interviews are often admitted as evidence in courts of law. Orthographic transcription is a time-consuming process and is usually carried out by untrained transcribers, resulting in records that contain summaries of large sections of the interview and paraphrased speech. The omission or inaccurate representation of important speech content could have serious consequences in a court of law. It is therefore clear that investigation into better solutions for police-interview transcription is required. This paper explores the possibility of incorporating automatic speech recognition (ASR) methods into the transcription process, with the goal of producing verbatim transcripts without sacrificing police time and money. We consider the potential viability of automatic transcripts as a “first” draft that would be manually corrected by police transcribers. The study additionally investigates the effects of audio quality, regional accent, and the ASR system used, as well as the types and magnitude of errors produced and their implications in the context of police-suspect interview transcripts.MethodsSpeech data was extracted from two forensically-relevant corpora, with speakers of two accents of British English: Standard Southern British English and West Yorkshire English (a non-standard regional variety). Both a high quality and degraded version of each file was transcribed using three commercially available ASR systems: Amazon, Google, and Rev.ResultsSystem performance varied depending on the ASR system and the audio quality, and while regional accent was not found to significantly predict word error rate, the distribution of errors varied substantially across the accents, with more potentially damaging errors produced for speakers of West Yorkshire English.DiscussionThe low word error rates and easily identifiable errors produced by Amazon suggest that the incorporation of ASR into the transcription of police-suspect interviews could be viable, though more work is required to investigate the effects of other contextual factors, such as multiple speakers and different types of background noise.

show abstract

Section: Automatic Systems and Speaker Factorsmentioning

confidence: 99%

Section: Automatic Systems and Speaker Factorsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Incorporating automatic speech recognition methods into the transcription of police-suspect interviews: factors affecting automatic performance

Harrington

2023

Front. Commun.

View full text Add to dashboard Cite

show abstract

“…how do you model a heterogeneous everchanging group of people (2) the problem addressed a responsible AI value in their team (3) the problem appears to be caused by a realistic data ethics issue i.e. resulting from data or an algorithm using data (4) prompts are in the form of written information from people within the company (5) prompts increase in severity or difficulty over time (6) prompts have clear directives on what to discuss (7) prompts involve a variety of stakeholders This example is inspired by recent research that has found bias against second language speakers of English [38] in automatic speech recognition systems (ASR).…”

Section: Illustrative Examplementioning

confidence: 99%

Data Ethics Emergency Drill: A Toolbox for Discussing Responsible AI for Industry Teams

Hanschke,

Rees,

Alanyali

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

show abstract

“…Accommodating technology shortcomings limits the potential benefits of technologies. However, when technologies with performance disparities are used in consequential domains -such as in job application videos -degraded service can not only stigmatize users but also lead to other types of harm, such as allocative harms [121].…”

Section: Service or Benefit Lossmentioning

confidence: 99%

Identifying Sociotechnical Harms of Algorithmic Systems: Scoping a Taxonomy for Harm Reduction

Shelby¹,

Rismani²,

Henne³

et al. 2022

Preprint

View full text Add to dashboard Cite

Understanding the landscape of potential harms from algorithmic systems enables practitioners to better anticipate consequences of the systems they build. It also supports the prospect of incorporating controls to help minimize harms that emerge from the interplay of technologies and social and cultural dynamics. A growing body of scholarship has identified a wide range of harms across different algorithmic technologies. However, computing research and practitioners lack a high level and synthesized overview of harms from algorithmic systems arising at the micro-, meso-, and macro-levels of society. We present an applied taxonomy of sociotechnical harms to support more systematic surfacing of potential harms in algorithmic systems. Based on a scoping review of computing research (n=172), we identified five major themes related to sociotechnical harms -representational, allocative, quality-of-service, interpersonal harms, and social system/societal harms -and sub-themes. We describe these categories and conclude with a discussion of challenges and opportunities for future research.

show abstract

Language variation and algorithmic bias: understanding algorithmic bias in British English automatic speech recognition

Cited by 21 publications

References 87 publications

Incorporating automatic speech recognition methods into the transcription of police-suspect interviews: factors affecting automatic performance

Incorporating automatic speech recognition methods into the transcription of police-suspect interviews: factors affecting automatic performance

Data Ethics Emergency Drill: A Toolbox for Discussing Responsible AI for Industry Teams

Identifying Sociotechnical Harms of Algorithmic Systems: Scoping a Taxonomy for Harm Reduction

Contact Info

Product

Resources

About