Tight Integrated End-to-End Training for Cascaded Speech Translation

Bahar, Parnia; Bieschke, Tobias; Schlüter, Ralf; Ney, Hermann

doi:10.48550/arxiv.2011.12167

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2021

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 27 publications

(44 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Unlike consecutive translation, where the translation is done after the speaker pauses, in SI the translation process starts while the speaker is still talking. With recent developments in machine translation and speech processing, various studies have been conducted aiming at automatic speech translation Inaguma et al, 2021;Bahar et al, 2021), including SI (Oda et al, 2014;Zheng et al, 2019;Arivazhagan et al, 2019;Zhang et al, 2020;Nguyen et al, 2021), based on speech corpora.…”

Section: Introductionmentioning

confidence: 99%

Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data

Doi¹,

Sudoh²,

Nakamura³

2021

Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)

View full text Add to dashboard Cite

This paper describes the construction of a new large-scale English-Japanese Simultaneous Interpretation (SI) corpus and presents the results of its analysis. A portion of the corpus contains SI data from three interpreters with different amounts of experience. Some of the SI data were manually aligned with the source speeches at the sentence level. Their latency, quality, and word order aspects were compared among the SI data themselves as well as against offline translations. The results showed that (1) interpreters with more experience controlled the latency and quality better, and (2) large latency hurt the SI quality.

show abstract