2021
DOI: 10.1016/j.eswa.2021.115740
|View full text |Cite
|
Sign up to set email alerts
|

Towards better subtitles: A multilingual approach for punctuation restoration of speech transcripts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 48 publications
0
3
0
Order By: Relevance
“…Previous work on multilingual punctuation prediction is described in Li and Lin (2020) and Guerreiro, Rei, and Batista (2021). Vandeghinste, Verwimp, Pelemans, and Wambacq (2018) models punctuation prediction in the context of speech translation, but also investigates a monolingual approach for Dutch, modelling punctuation prediction as a machine translation problem, in which the source language is the text without punctuation, and the target language is the text with punctuation.…”
Section: The Belgian Federal Covid-19 Sign Language (Becos) Corpusmentioning
confidence: 99%
“…Previous work on multilingual punctuation prediction is described in Li and Lin (2020) and Guerreiro, Rei, and Batista (2021). Vandeghinste, Verwimp, Pelemans, and Wambacq (2018) models punctuation prediction in the context of speech translation, but also investigates a monolingual approach for Dutch, modelling punctuation prediction as a machine translation problem, in which the source language is the text without punctuation, and the target language is the text with punctuation.…”
Section: The Belgian Federal Covid-19 Sign Language (Becos) Corpusmentioning
confidence: 99%
“…Previous work on multilingual punctuation prediction is described in Li and Lin (2020) and Guerreiro et al (2021). Vandeghinste et al (2018) models punctuation prediction in the context of speech translation, but also investigates a monolingual approach for Dutch, modelling punctuation prediction as a machine translation problem, in which the source language is the text without punctuation, and the target language is the text with punctuation.…”
Section: Related Workmentioning
confidence: 99%
“…Text post-processing after ASR, e.g. punctuation and capitalization (Guerreiro et al, 2021) and inverse text normalization (Sunkara et al, 2021), is another important problem for multilingual and CS speech systems. Such post-processing is harder than the monolingual scenario as it requires accurate language identification in addition to transcript generation.…”
Section: Introductionmentioning
confidence: 99%