An Empirical Study of Machine Translation for the Shared Task of WMT18

Bei, Chao; Hao, Zong Rui; Wang, Yiming; Fan, Baoyong; Li, Shiqi; Yuan, Conghu

doi:10.18653/v1/w18-6404

Cited by 9 publications

(7 citation statements)

References 6 publications

(4 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Due to the non-standard way of submission, the system is not considered a regular participant, but an invited/late submission and marked with " " throughout the paper. (Bei et al, 2018) GTCOM-PRIMARY is based on the Transformer "base" model architecture using Marian toolkit, and it also applies some methods that have been proven effective in NMT system, such as BPE, back-translation, right-to-left reranking and ensembling decoding. In this experiment, right-toleft reranking does not help.…”

Section: Alibaba (mentioning

confidence: 99%

Findings of the 2018 Conference on Machine Translation (WMT18)

Bojar¹,

Federmann²,

Fishel³

et al. 2018

Proceedings of the Third Conference on Machine Translation: Shared Task Papers

268

223

View full text Add to dashboard Cite

This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2018. Participants were asked to build machine translation systems for any of 7 language pairs in both directions, to be evaluated on a test set of news stories. The main metric for this task is human judgment of translation quality. This year, we also opened up the task to additional test sets to probe specific aspects of translation. Language Sources (Number of Documents) English ABC News (1), BBC (4), Brisbane Times (1), CBS News (1), Daily Mail (4), Euronews (3), Globe and Mail (1), Guardian (4), Independent (4), Los Angeles Times (4), MSNBC (3), Novinte (2), New York Times (2), Reuters (3), Russia Today (2), Scotsman (2), Sydney Morning Herald (2), Telegraph (2), The Local (2), Time Magazine (2), UPI (1), Washington Post (3) Czech blesk.cz (16), deník.cz (5), Deník Referendum (1), DNES.cz (7), lidovky.cz (6), Novinky.cz (3), Reflex (2), tyden.cz (12), ZDN (2) German Aachener Nachrichten (1), Abendzeitung Nürnberg (2), Braunschweiger Zeitung (1

show abstract

Section: Alibaba (mentioning

confidence: 99%

Findings of the 2018 Conference on Machine Translation (WMT18)

Bojar¹,

Federmann²,

Fishel³

et al. 2018

Proceedings of the Third Conference on Machine Translation: Shared Task Papers

268

223

View full text Add to dashboard Cite

show abstract

“…As a result, the MT field faces various data quality issues such as misalignment and incorrect translations, which may significantly impact translation quality . A straightforward solution is to apply a filtering approach, where noisy data are filtered out and a smaller subset of high-quality sentence pairs is retained (Bei et al, 2018;Junczys-Dowmunt, 2018;Rossenbach et al, 2018). Nevertheless, it is unclear whether such a filtering approach can be successfully applied to GEC, where commonly available datasets tend to be far smaller than those used in recent neural MT research.…”

Section: Related Workmentioning

confidence: 99%

“…We evaluated the effectiveness of our method over several GEC datasets, and found that it considerably outperformed baseline methods, includ-ing three strong denoising baselines based on a filtering approach, which is a common approach in MT (Bei et al, 2018;Junczys-Dowmunt, 2018;Rossenbach et al, 2018). We further improved the performance by applying task-specific techniques and achieved state-of-the-art performance on the CoNLL-2014, JFLEG, and BEA-2019 benchmarks.…”

Section: Introductionmentioning

confidence: 96%

A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction

Mita

Kiyono

Kaneko³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Existing approaches for grammatical error correction (GEC) largely rely on supervised learning with manually created GEC datasets. However, there has been little focus on verifying and ensuring the quality of the datasets, and on how lower-quality data might affect GEC performance. We indeed found that there is a non-negligible amount of "noise" where errors were inappropriately edited or left uncorrected. To address this, we designed a self-refinement method where the key idea is to denoise these datasets by leveraging the prediction consistency of existing models, and outperformed strong denoising baseline methods. We further applied task-specific techniques and achieved state-of-the-art performance on the CoNLL-2014, JFLEG, and BEA-2019 benchmarks. We then analyzed the effect of the proposed denoising method, and found that our approach leads to improved coverage of corrections and facilitated fluency edits which are reflected in higher recall and overall performance.

show abstract

“…The methods of data filtering by human rules are mainly the same as we did in English to Chinese (Bei et al, 2018) last year, but language models are used to clean all data, including monolingual data, parallel data and synthetic data. We use Marian to train the transformer language model for each language (i.e.…”

Section: Data Filteringmentioning

confidence: 99%

GTCOM Neural Machine Translation Systems for WMT19

Bei¹,

Hao²,

Yuan³

et al. 2019

Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)

Self Cite

View full text Add to dashboard Cite

This paper describes the Global Tone Communication Co., Ltd.'s submission of the WMT19 shared news translation task. We participate in six directions: English to (Gujarati, Lithuanian and Finnish) and (Gujarati, Lithuanian and Finnish) to English. Further, we get the best BLEU scores in the directions of English to Gujarati and Lithuanian to English (28.2 and 36.3 respectively) among all the participants. The submitted systems mainly focus on backtranslation, knowledge distillation and reranking to build a competitive model for this task. Also, we apply language model to filter monolingual data, back-translated data and parallel data. The techniques we apply for data filtering include filtering by rules, language models. Besides, We conduct several experiments to validate different knowledge distillation techniques and right-to-left (R2L) reranking.

show abstract

An Empirical Study of Machine Translation for the Shared Task of WMT18

Cited by 9 publications

References 6 publications

Findings of the 2018 Conference on Machine Translation (WMT18)

Findings of the 2018 Conference on Machine Translation (WMT18)

A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction

GTCOM Neural Machine Translation Systems for WMT19

Contact Info

Product

Resources

About