End-end Speech-to-Text Translation with Modality Agnostic Meta-Learning

Indurthi, Sathish Reddy; Han, HouJeung; Lakumarapu, Nikhil Kumar; Lee, Beomseok; Chung, Insoo; Kim, Sang-Ha; Kim, Chanwoo

doi:10.1109/icassp40776.2020.9054759

Cited by 39 publications

(44 citation statements)

References 21 publications

(20 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This is the only submission to use an end-to-end approach for the speech track. The authors use transformer-based models combining the wait-k strategy with a modality-agnostic meta learning approach (Indurthi et al, 2020) to address data sparsity. They also use the ST task along with ASR and MT as the source task, a minor variation explored compared to the original paper.…”

Section: Submissionsmentioning

confidence: 99%

“…BHANSS (Lakumarapu et al, 2020) built their end-to-end system adopting the Transformer architecture (Vaswani et al, 2017a) coupled with the meta-learning approach proposed in (Indurthi et al, 2020). Meta-learning is used mitigate the issue of over-fitting when the training data is limited, as in the ST case, and allows their system to take advantage of the available ASR and MT data.…”

Section: Submissionsmentioning

confidence: 99%

“…Recently, (Indurthi et al, 2020) proposed a Modality Agnostic Meta-Learning (MAML, (Finn et al, 2017)) approach to address the data scarcity issue in the speech-to-text translation task. We adopt this approach to train our simultaneous translation task.…”

Section: Meta Learningmentioning

confidence: 99%

“…We adopt this approach to train our simultaneous translation task. Here, we briefly describe the MAML approach used for training, for more details, please refer to (Indurthi et al, 2020).…”

Section: Meta Learningmentioning

confidence: 99%

“…In this work, we build a simultaneous translation system for text-to-text(t2t) and speech-to-text(s2t) problems based on Transformer wait-k model . We adopt the meta-learning approach presented in (Indurthi et al, 2020) to deal with the data scarcity issue in the speech-to-text translation task. The system architecture and data processing techniques are designed for the IWSLT 2020 online translation task .…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Proceedings of the 17th International Conference on Spoken Language Translation

Federico¹,

Waibel²,

Knight³

et al. 2020

View full text Add to dashboard Cite

The conference chairs and organizers would like to express their gratitude to everyone who contributed and supported IWSLT. Our IWSLT-20 program exceeds all our expectations in quality and breath, particularly when considering the challenges during a pandemic under lock-downs and health and travel restrictions. We thank the challenge track chairs, organizers, and participants, the program chairs and committee members, as well as all the authors that went the extra mile to submit system and research papers to IWSLT, and make this year's conference our most vibrant than ever. We also wish to express our sincere gratitude to ACL for hosting our conference and for arranging the logistics and infrastructure that allow us to hold IWSLT 2020 as a virtual online conference.

show abstract

Section: Submissionsmentioning

confidence: 99%

Section: Submissionsmentioning

confidence: 99%

Section: Meta Learningmentioning

confidence: 99%

“…We adopt this approach to train our simultaneous translation task. Here, we briefly describe the MAML approach used for training, for more details, please refer to (Indurthi et al, 2020).…”

Section: Meta Learningmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations