Jindřich Helcl scite author profile

Modeling attention in neural multi-source sequence-to-sequence learning remains a relatively unexplored area, despite its usefulness in tasks that incorporate multiple source languages or modalities. We propose two novel approaches to combine the outputs of attention mechanisms over each source sequence, flat and hierarchical. We compare the proposed methods with existing techniques and present results of systematic evaluation of those methods on the WMT16 Multimodal Translation and Automatic Post-editing tasks. We show that the proposed methods achieve competitive results on both tasks.

show abstract

End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification

Libovický¹,

Helcl²

2018

116

108

View full text Add to dashboard Cite

Autoregressive decoding is the only part of sequence-to-sequence models that prevents them from massive parallelization at inference time. Non-autoregressive models enable the decoder to generate all output symbols independently in parallel. We present a novel nonautoregressive architecture based on connectionist temporal classification and evaluate it on the task of neural machine translation. Unlike other non-autoregressive methods which operate in several steps, our model can be trained end-to-end. We conduct experiments on the WMT English-Romanian and English-German datasets. Our models achieve a significant speedup over the autoregressive models, keeping the translation quality comparable to other non-autoregressive models.

show abstract

CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks

Libovický¹,

Helcl²,

Tlustý³

et al. 2016

View full text Add to dashboard Cite

Neural sequence to sequence learning recently became a very promising paradigm in machine translation, achieving competitive results with statistical phrase-based systems. In this system description paper, we attempt to utilize several recently published methods used for neural sequential learning in order to build systems for WMT 2016 shared tasks of Automatic Post-Editing and Multimodal Machine Translation.

show abstract

CUNI System for the WMT18 Multimodal Translation Task

Helcl¹,

Libovický²,

Variš³

2018

View full text Add to dashboard Cite

We present our submission to the WMT18 Multimodal Translation Task. The main feature of our submission is applying a selfattentive network instead of a recurrent neural network. We evaluate two methods of incorporating the visual features in the model: first, we include the image representation as another input to the network; second, we train the model to predict the visual features and use it as an auxiliary objective. For our submission, we acquired both textual and multimodal additional data. Both of the proposed methods yield significant improvements over recurrent networks and self-attentive textual baselines.

show abstract

Input Combination Strategies for Multi-Source Transformer Decoder

Libovický¹,

Helcl²,

Mareċek³

2018

View full text Add to dashboard Cite

In multi-source sequence-to-sequence tasks, the attention mechanism can be modeled in several ways. This topic has been thoroughly studied on recurrent architectures. In this paper, we extend the previous work to the encoder-decoder attention in the Transformer architecture. We propose four different input combination strategies for the encoderdecoder attention: serial, parallel, flat, and hierarchical. We evaluate our methods on tasks of multimodal translation and translation with multiple source languages. The experiments show that the models are able to use multiple sources and improve over single source baselines.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jindřich Helcl

Attention Strategies for Multi-Source Sequence-to-Sequence Learning

End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification

CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks

CUNI System for the WMT18 Multimodal Translation Task

Input Combination Strategies for Multi-Source Transformer Decoder

Contact Info

Product

Resources

About