Interspeech 2023 2023
DOI: 10.21437/interspeech.2023-2225
|View full text |Cite
|
Sign up to set email alerts
|

Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff

Abstract: Blockwise self-attentional encoder models have recently emerged as one promising end-to-end approach to simultaneous speech translation. These models employ a blockwise beam search with hypothesis reliability scoring to determine when to wait for more input speech before translating further. However, this method maintains multiple hypotheses until the entire speech input is consumed -this scheme cannot directly show a single incremental translation to users. Further, this method lacks mechanisms for controllin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 32 publications
0
1
0
Order By: Relevance
“…Therefore, we proposed an improved version of the AL metric, which was later independently proposed under name length-adaptive average lagging (LAAL; Papi et al, 2022). To remedy the over-generation problem, we proposed an improved version of the beam search algorithm in Polák et al (2023b). While this led to significant improvements in the quality-latency tradeoff, the decoding still relied on label-synchronous decoding.…”
Section: Quality-latency Tradeoff In Sstmentioning
confidence: 99%
“…Therefore, we proposed an improved version of the AL metric, which was later independently proposed under name length-adaptive average lagging (LAAL; Papi et al, 2022). To remedy the over-generation problem, we proposed an improved version of the beam search algorithm in Polák et al (2023b). While this led to significant improvements in the quality-latency tradeoff, the decoding still relied on label-synchronous decoding.…”
Section: Quality-latency Tradeoff In Sstmentioning
confidence: 99%