Doubly-Attentive Decoder for Multi-modal Neural Machine Translation

Calixto, Iacer; Li, Qun; Campbell, Nick

doi:10.18653/v1/p17-1175

Cited by 131 publications

(99 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…First, Section 3.1 looks at NMT for e-commerce, describing important parts of a more extended study that is reported in detail in Calixto et al (2017b). The second use case (Section 3.2) is an evaluation performed by Iconic Translation Machines Ltd.…”

Section: Use Casesmentioning

confidence: 99%

“…MT Systems -Three different systems were compared in this experiment (1) a PBSMT baseline model built with the Moses SMT Toolkit (Koehn et al, 2007), (2) a text-only NMT model (NMT t ), and (3) a multi-modal NMT model (NMT m ), described in more detail in Calixto et al (2017b), which expands upon the text-only attentionbased model and introduces a visual component to incorporate local visual features.…”

Section: Nmt For E-commerce Product Listingmentioning

confidence: 99%

See 1 more Smart Citation

Is Neural Machine Translation the New State of the Art?

Castilho¹,

Moorkens²,

Gaspari³

et al. 2017

The Prague Bulletin of Mathematical Linguistics

Self Cite

152

View full text Add to dashboard Cite

This paper discusses neural machine translation (NMT), a new paradigm in the MT field, comparing the quality of NMT systems with statistical MT by describing three studies using automatic and human evaluation methods. Automatic evaluation results presented for NMT are very promising, however human evaluations show mixed results. We report increases in fluency but inconsistent results for adequacy and post-editing effort. NMT undoubtedly represents a step forward for the MT field, but one that the community should be careful not to oversell.

show abstract

Section: Use Casesmentioning

confidence: 99%

Section: Nmt For E-commerce Product Listingmentioning

confidence: 99%

Is Neural Machine Translation the New State of the Art?

Castilho¹,

Moorkens²,

Gaspari³

et al. 2017

The Prague Bulletin of Mathematical Linguistics

Self Cite

152

View full text Add to dashboard Cite

show abstract

“…By using an ensemble of four different multimodal NMT models trained on the translated Multi30k training data, we were able to obtain translations comparable to or even better than those obtained with the strong multi-modal NMT model of Calixto et al (2017a), which is pretrained on large amounts of WMT data and uses local image features.…”

Section: Resultsmentioning

confidence: 96%

“…Table 3: Results for the best model of Calixto et al (2017a), which is pre-trained on the English-German WMT 2015 (Bojar et al, 2015), and different combinations of multi-modal models, all trained on the original M30k T training data only, evaluated on the M30k T 2016 test set.…”

Section: Resultsmentioning

confidence: 99%

“…We ensembled different models by starting with one of Calixto et al (2017b)'s best performing multi-modal models on this data set, IMG D , and by adding new models to the ensemble one by one, until we reach a maximum of four independent models, all of which are trained separately and on the original M30k T training data only. We also report results for the best model of Calixto et al (2017a), which is pre-trained on the English-German WMT 2015 (Bojar et al, 2015) and uses local visual features extracted with the ResNet-50 network (He et al, 2015). We first note that adding more models to the ensemble seems to always improve translations by a large margin (∼ 3 BLEU/METEOR points).…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

DCU System Report on the WMT 2017 Multi-modal Machine Translation Task

Calixto¹,

Chowdhury²,

Li³

2017

Proceedings of the Second Conference on Machine Translation

Self Cite

View full text Add to dashboard Cite

We report experiments with multi-modal neural machine translation models that incorporate global visual features in different parts of the encoder and decoder, and use the VGG19 network to extract features for all images. In our experiments, we explore both different strategies to include global image features and also how ensembling different models at inference time impact translations. Our submissions ranked 3rd best for translating from English into French, always improving considerably over an neural machine translation baseline across all language pair evaluated, e.g. an increase of 7.0-9.2 ME-TEOR points.

show abstract

Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora

Zhang¹,

Li²,

Su³

2019

Natural Language Processing and Chinese Computing

View full text Add to dashboard Cite

Chinese language has evolved a lot during the long time of development. Native speakers now have trouble in reading sentences in ancient Chinese. In this paper, we intend to build an end-to-end neural model to automatically translate between ancient and contemporary Chinese. However, the existing ancientcontemporary Chinese parallel corpora is not aligned at the sentence level, making it difficult to train our model. To build the sentence level parallel training data for our model, we propose an unsupervised algorithm that constructs sentence-aligned ancient-contemporary pairs out of the abundant passage-aligned corpus by using the fact that the aligned sentence pair shares many of the tokens. Based on the aligned corpus, we propose an end-to-end neural model with copy mechanism to translate between ancient and contemporary Chinese. Experiments show that the proposed unsupervised algorithm achieves 99.4% F1 score for sentence alignment, and the translation model achieves 26.95 BLEU from ancient to contemporary, and 36.34 BLEU from contemporary to ancient.

show abstract

Doubly-Attentive Decoder for Multi-modal Neural Machine Translation

Cited by 131 publications

References 34 publications

Is Neural Machine Translation the New State of the Art?

Is Neural Machine Translation the New State of the Art?

DCU System Report on the WMT 2017 Multi-modal Machine Translation Task

Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora

Contact Info

Product

Resources

About