The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task

Xu, Chen; Liu, Xiaoqian; Liu, Xiaowen; Wang, Tiger; Huang, Canan; Xiao, Tong; Zhu, Jun

doi:10.18653/v1/2021.iwslt-1.9

Cited by 2 publications

(3 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Two main word detection strategies are currently employed by the community: fixed (Ma et al, 2020b), and adaptive (Ma et al, 2020b;Ren et al, 2020;Zeng et al, 2021;Chen et al, 2021). The fixed word detection strategy represents the easiest way to address the problem since it assumes that a fixed amount of time is required to pronounce every word, disregarding the information contained in the audio.…”

Section: Word Detection For Wait-kmentioning

confidence: 99%

“…On the contrary, the adaptive word detection strategy determines the number of words by looking at the content of the audio. The decision about waiting or emitting can be taken through an Automatic Speech Recognition decoder (Chen et al, 2021) 2 or through a Connectionist Temporal Classification (Graves et al, 2006) -or CTC -module (Ren et al, 2020;Zeng et al, 2021), responsible for directly detecting the number of words every time a speech chunk is received by the system.…”

Section: Word Detection For Wait-kmentioning

confidence: 99%

“…The most popular decision policy is the wait-k, a straightforward heuristic that prescribes to wait for a predefined number of words before starting to generate the translation. Initially proposed by Ma et al (2020b) for simultaneous machine translation (SimulMT), the wait-k has been widely adopted in SimulST (Ma et al, 2020b;Ren et al, 2020;Chen et al, 2021;Zeng et al, 2021; thanks to its simplicity. Apart from wait-k, other attempts have been made to develop decision policies learned by the SimulST system itself (Ma et al, 2019b;Zaidi et al, 2021;Liu et al, 2021a,b), all resulting in computationally expensive models with limited diffusion.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Does Simultaneous Speech Translation need Simultaneous Models?

Papi¹,

Gaido²,

Negri³

et al. 2022

Preprint

View full text Add to dashboard Cite

In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task. To meet the latency constraints posed by the different application scenarios, multiple dedicated SimulST models are usually trained and maintained, generating high computational costs. In this paper, motivated by the increased social and environmental impact caused by these costs, we investigate whether a single model trained offline can serve not only the offline but also the simultaneous task without the need for any additional training or adaptation. Experiments on en→{de, es} indicate that, aside from facilitating the adoption of well-established offline techniques and architectures without affecting latency, the offline solution achieves similar or better translation quality compared to the same model trained in simultaneous settings, as well as being competitive with the SimulST state of the art.

show abstract

Section: Word Detection For Wait-kmentioning

confidence: 99%

Section: Word Detection For Wait-kmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Does Simultaneous Speech Translation need Simultaneous Models?

Papi¹,

Gaido²,

Negri³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

Findings of the Iwslt 2021 Evaluation Campaign

Anastasopoulos

Bojar

Bremerman³

et al. 2021

Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)

View full text Add to dashboard Cite

The evaluation campaign of the International Conference on Spoken Language Translation (IWSLT 2021) featured this year four shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Multilingual speech translation, (iv) Low-resource speech translation. A total of 22 teams participated in at least one of the tasks. This paper describes each shared task, data and evaluation metrics, and reports results of the received submissions.

show abstract

The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task

Cited by 2 publications

References 24 publications

Does Simultaneous Speech Translation need Simultaneous Models?

Does Simultaneous Speech Translation need Simultaneous Models?

Findings of the Iwslt 2021 Evaluation Campaign

Contact Info

Product

Resources

About