Towards Robust Neural Machine Translation

Cheng, Yong; Tu, Zhaopeng; Meng, Fandong; Zhai, Junjie; Liu, Yang

doi:10.18653/v1/p18-1163

Cited by 149 publications

(143 citation statements)

References 22 publications

Supporting

Mentioning

141

Contrasting

Order By: Relevance

“…Experimental results on NIST Chinese⇒English translation show that DTMT can outperform the Transformer model by +2.09 BLEU points and achieve the best results ever reported in the same dataset. On WMT14 English⇒German and English⇒French translation, it consistently leads to substantial improvements and shows superior quality to the state-of-the-art NMT systems (Vaswani et al 2017;Cheng et al 2018). The main contributions of this paper can be summarized as follows:…”

Section: Introductionmentioning

confidence: 87%

“…Zhang et al (2018) propose to exploit both left-to-right and right-to-left decoding strategies to capture bidirectional dependencies. Meng et al (2018) propose key-value memory augmented attention to improve the adequacy of translation. Compared with them, our baseline system SHALLOWRNMT outperforms their best models by more than 3 BLEU points.…”

Section: System Descriptionmentioning

confidence: 99%

See 1 more Smart Citation

DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

Meng

Zhang

2019

AAAI

Self Cite

View full text Add to dashboard Cite

Past years have witnessed rapid developments in Neural Machine Translation (NMT). Most recently, with advanced modeling and training techniques, the RNN-based NMT (RNMT) has shown its potential strength, even compared with the well-known Transformer (self-attentional) model. Although the RNMT model can possess very deep architectures through stacking layers, the transition depth between consecutive hidden states along the sequential axis is still shallow. In this paper, we further enhance the RNNbased NMT through increasing the transition depth between consecutive hidden states and build a novel Deep Transition RNN-based Architecture for Neural Machine Translation, named DTMT. This model enhances the hidden-tohidden transition with multiple non-linear transformations, as well as maintains a linear transformation path throughout this deep transition by the well-designed linear transformation mechanism to alleviate the gradient vanishing problem. Experiments show that with the specially designed deep transition modules, our DTMT can achieve remarkable improvements on translation quality. Experimental results on Chinese⇒English translation task show that DTMT can outperform the Transformer model by +2.09 BLEU points and achieve the best results ever reported in the same dataset. On WMT14 English⇒German and English⇒French translation tasks, DTMT 1 shows superior quality to the state-of-the-art NMT systems, including the Transformer and the RNMT+.

show abstract

Section: Introductionmentioning

confidence: 87%

Section: System Descriptionmentioning

confidence: 99%

DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

Meng

Zhang

2019

AAAI

Self Cite

View full text Add to dashboard Cite

show abstract

“…For instance, one could optimize for f-BLEU or any of the other reference-less measures that we proposed, in the same way that an MT system is optimized for BLEU (either by explicitly using their scores through reinforcement learning or by simply using the metric as an early stopping criterion over a development set). Cheng et al (2018) recently proposed an approach for training more robust MT systems, albeit in a supervised setting where noise is injected on parallel data, and the proposed solutions of Belinkov and Bisk (2018) and Anastasopoulos et al (2019) fall within the same category. However, no approach has, to our knowledge, used GEC corpora for training MT systems robust to grammatical errors.…”

Section: Limitations and Extensionsmentioning

confidence: 99%

An Analysis of Source-Side Grammatical Errors in NMT

Anastasopoulos¹

2019

Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP

View full text Add to dashboard Cite

The quality of Neural Machine Translation (NMT) has been shown to significantly degrade when confronted with source-side noise. We present the first large-scale study of stateof-the-art English-to-German NMT on real grammatical noise, by evaluating on several Grammar Correction corpora. We present methods for evaluating NMT robustness without true references, and we use them for extensive analysis of the effects that different grammatical errors have on the NMT output. We also introduce a technique for visualizing the divergence distribution caused by a sourceside error, which allows for additional insights.

show abstract

“…• Translation: We use Google Neural Machine Translation (GNMT) system for translation. 7 We evaluated GNMT system on NIST MT02/03/04/05/06/08 Chinese-English set and achieved an average BLEU score of 43.24, compared to previous best work (43.20) (Cheng et al, 2018), yielding state-of-the-art performance.…”

Section: Experimental Setupsmentioning

confidence: 96%

Cross-Lingual Machine Reading Comprehension

Cui¹,

Che²,

Liu³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Though the community has made great progress on Machine Reading Comprehension (MRC) task, most of the previous works are solving English-based MRC problems, and there are few efforts on other languages mainly due to the lack of large-scale training data. In this paper, we propose Cross-Lingual Machine Reading Comprehension (CLMRC) task for the languages other than English. Firstly, we present several back-translation approaches for CLMRC task, which is straightforward to adopt. However, to accurately align the answer into another language is difficult and could introduce additional noise. In this context, we propose a novel model called Dual BERT, which takes advantage of the large-scale training data provided by rich-resource language (such as English) and learn the semantic relations between the passage and question in a bilingual context, and then utilize the learned knowledge to improve reading comprehension performance of low-resource language. We conduct experiments on two Chinese machine reading comprehension datasets CMRC 2018 and DRCD. The results show consistent and significant improvements over various stateof-the-art systems by a large margin, which demonstrate the potentials in CLMRC task. 1

show abstract

Towards Robust Neural Machine Translation

Cited by 149 publications

References 22 publications

DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

An Analysis of Source-Side Grammatical Errors in NMT

Cross-Lingual Machine Reading Comprehension

Contact Info

Product

Resources

About