How Grammatical is Character-level Neural Machine Translation?
            Assessing MT Quality with Contrastive Translation Pairs

Sennrich, Rico

doi:10.18653/v1/e17-2060

Cited by 108 publications

(132 citation statements)

References 16 publications

(28 reference statements)

Supporting

Mentioning

128

Contrasting

Unclassified

Order By: Relevance

“…Our goal instead, is to analyze the behavior of the MT system when confronted with ungrammatical input. Reference-less evaluation has also been proposed for text simplification (Martin et al, 2018) and GEC (Napoles et al, 2016), while the grammaticality of MT systems' outputs has been evaluated with target-side contrastive pairs (Sennrich, 2017). In this work, the core of our evaluation of a system's robustness lies in the following observation: a perfectly robust-to-noise MT system would produce the exact same output for the clean and erroneous versions of the same input sentence.…”

Section: Evaluating Nmt Robustness Without Referencesmentioning

confidence: 99%

An Analysis of Source-Side Grammatical Errors in NMT

Anastasopoulos¹

2019

Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP

View full text Add to dashboard Cite

The quality of Neural Machine Translation (NMT) has been shown to significantly degrade when confronted with source-side noise. We present the first large-scale study of stateof-the-art English-to-German NMT on real grammatical noise, by evaluating on several Grammar Correction corpora. We present methods for evaluating NMT robustness without true references, and we use them for extensive analysis of the effects that different grammatical errors have on the NMT output. We also introduce a technique for visualizing the divergence distribution caused by a sourceside error, which allows for additional insights.

show abstract

Section: Evaluating Nmt Robustness Without Referencesmentioning

confidence: 99%

An Analysis of Source-Side Grammatical Errors in NMT

Anastasopoulos¹

2019

Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP

View full text Add to dashboard Cite

show abstract

“…Here, we seek to answer this question by testing our models on Lingeval97 (Sennrich, 2017), a test set which provides contrastive translation pairs for different types of errors. For the example of subject-verb agreement, contrastive translations are created from a reference translation by changing the grammatical number of the verb, and we can measure how often the NMT model prefers the correct reference over the contrastive variant.…”

Section: Error Analysismentioning

confidence: 99%

Deep architectures for Neural Machine Translation

Barone

Helcl

Sennrich

et al. 2017

Proceedings of the Second Conference on Machine Translation

Self Cite

View full text Add to dashboard Cite

It has been shown that increasing model depth improves the quality of neural machine translation.However, different architectural variants to increase model depth have been proposed, and so far, there has been no thorough comparative study.In this work, we describe and evaluate several existing approaches to introduce depth in neural machine translation. Additionally, we explore novel architectural variants, including deep transition RNNs, and we vary how attention is used in the deep decoder. We introduce a novel "BiDeep" RNN architecture that combines deep transition RNNs and stacked RNNs.Our evaluation is carried out on the English to German WMT news translation dataset, using a single-GPU machine for both training and inference. We find that several of our proposed architectures improve upon existing approaches in terms of speed and translation quality. We obtain best improvements with a BiDeep RNN of combined depth 8, obtaining an average improvement of 1.5 BLEU over a strong shallow baseline.We release our code for ease of adoption.

show abstract

“…Unsupervised NMT The current NMT systems (Sutskever et al, 2014;Cho et al, 2014a;Bahdanau et al, 2015;Gehring et al, 2017;Vaswani et al, 2017) are known to easily overfit and result in an inferior performance when the training data is limited (Koehn and Knowles, 2017;Isabelle et al, 2017;Sennrich, 2017). Many research efforts have been spent on how to utilize the monolingual data to improve the NMT system when only limited supervision is available (Gulcehre et al, 2015;Sennrich et al, 2016a;He et al, 2016;Zhang and Zong, 2016;.…”

Section: Related Workmentioning

confidence: 99%

Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation

Wu¹,

Wang²,

Wang³

2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

The overreliance on large parallel corpora significantly limits the applicability of machine translation systems to the majority of language pairs. Back-translation has been dominantly used in previous approaches for unsupervised neural machine translation, where pseudo sentence pairs are generated to train the models with a reconstruction loss. However, the pseudo sentences are usually of low quality as translation errors accumulate during training. To avoid this fundamental issue, we propose an alternative but more effective approach, extract-edit, to extract and then edit real sentences from the target monolingual corpora. Furthermore, we introduce a comparative translation loss to evaluate the translated target sentences and thus train the unsupervised translation systems. Experiments show that the proposed approach consistently outperforms the previous state-of-the-art unsupervised machine translation systems across two benchmarks (English-French and English-German) and two low-resource language pairs (English-Romanian and English-Russian) by more than 2 (up to 3.63) BLEU points. Recently, neural-based methods (Chu et al., 2016;Grover and Mitra, 2017;Grégoire and Langlais, 2018) aim to select potential parallel sentences

show abstract

How Grammatical is Character-level Neural Machine Translation? Assessing MT Quality with Contrastive Translation Pairs

Cited by 108 publications

References 16 publications

An Analysis of Source-Side Grammatical Errors in NMT

An Analysis of Source-Side Grammatical Errors in NMT

Deep architectures for Neural Machine Translation

Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation

Contact Info

Product

Resources

About