2022
DOI: 10.1111/lnc3.12474
|View full text |Cite
|
Sign up to set email alerts
|

Computational sociophonetics using automatic speech recognition

Abstract: Recent years have seen numerous advances in natural language processing that can help accelerate sociophonetic work. These include software to align speech recordings with their transcriptions, as well as to transcribe audio automatically. This solves a major bottleneck and will help process larger datasets and test hypotheses more efficiently. This paper will summarise recent progress, highlight relevant examples of sociophonetic research, and comment on the technical and ethical issues at the cutting edge of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 118 publications
(127 reference statements)
0
1
0
Order By: Relevance
“…Recent studies explored fine-tuning of pretrained self-supervised models for ASR using speech from low-resource languages (e.g., Coto-Solano et al 2022;Guillaume et al 2022), and difficulties of modeling resource-scarce languages and dialects were acknowledged in previous work (Aksënova et al, 2022). It remains an open question to what extent model performance is dependent on the amount of fine-tuning data and the type of language, when the total amount of available data for a language is limited.…”
Section: Introductionmentioning
confidence: 99%
“…Recent studies explored fine-tuning of pretrained self-supervised models for ASR using speech from low-resource languages (e.g., Coto-Solano et al 2022;Guillaume et al 2022), and difficulties of modeling resource-scarce languages and dialects were acknowledged in previous work (Aksënova et al, 2022). It remains an open question to what extent model performance is dependent on the amount of fine-tuning data and the type of language, when the total amount of available data for a language is limited.…”
Section: Introductionmentioning
confidence: 99%