Zero-pronoun Data Augmentation for Japanese-to-English Translation

Ri, Ryokan; Nakazawa, Toshiaki; Tsuruoka, Yoshimasa

doi:10.18653/v1/2021.wat-1.11

Cited by 2 publications

(6 citation statements)

References 17 publications

(11 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Language Bias. Most works used Chinese and Japanese datasets as testbed for training ZP models (Song et al, 2020;Ri et al, 2021). However, there were limited data available for other prodrop languages (e.g.…”

Section: Discussion and Findingsmentioning

confidence: 99%

“…They trained an external model on the ZP data to recover the ZP information in the input sequence of the MT model (Tan et al, 2019;Ohtani et al, 2019;Tan et al, 2021) or correct the errors in the translation outputs (Voita et al, 2019). Others aimed to up-sample the training data for the ZPT task (Sugiyama and Yoshinaga, 2019;Kimura et al, 2019;Ri et al, 2021). They preferred to improve the ZPT performance via a data augmentation without modifying the MT architecture (Wang et al, 2016a;Sugiyama and Yoshinaga, 2019).…”

Section: Data-level Methods Do Not Change Modelmentioning

confidence: 99%

“…They preferred to improve the ZPT performance via a data augmentation without modifying the MT architecture (Wang et al, 2016a;Sugiyama and Yoshinaga, 2019). Kimura et al (2019); Ri et al (2021) verified that the performance can be further improved by denoising the pseudo data. 4.…”

Section: Data-level Methods Do Not Change Modelmentioning

confidence: 99%

“…Modeling ZPs for advanced NMT models, however, has received more attention, resulting in better performance in this field (Wang et al, 2018a;Tan et al, 2021;Hwang et al, 2021). Generally prior works fall into three categories: (1) Pipeline, where input sentences are labeled with ZPs using an external ZP recovery system and then fed into a standard MT model (Chung and Gildea, 2010;Wang et al, 2016a); (2) Implicit, where ZP phenomenon is implicitly resolved by modelling document-level contexts Ri et al, 2021); (3) Endto-End, where ZP prediction and translation are jointly learned in an end-to-end manner Tan et al, 2021).…”

Section: Overviewmentioning

confidence: 99%

“…Furthermore, COMET (Rei et al, 2020) is a neural framework for training multilingual MT evaluation models which obtains new SOTA levels of correlation with human judgements. • Pronoun-Aware Translation Quality: Previous works usually evaluate ZPT using the BLEU metric (Wang et al, 2016a(Wang et al, , 2018aRi et al, 2021), however, general-purpose metrics cannot characterize the performance of ZP translation. As shown in Table 3, the missed or incorrect pronouns may not affect BLEU scores but severely harm true performances.…”

Section: Overviewmentioning

confidence: 99%

See 4 more Smart Citations

A Survey on Zero Pronoun Translation

Wang¹,

Liu²,

Xu³

et al. 2023

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

Zero pronouns (ZPs) are frequently omitted in pro-drop languages (e.g. Chinese, Hungarian, and Hindi), but should be recalled in nonpro-drop languages (e.g. English). This phenomenon has been studied extensively in machine translation (MT), as it poses a significant challenge for MT systems due to the difficulty in determining the correct antecedent for the pronoun. This survey paper highlights the major works that have been undertaken in zero pronoun translation (ZPT) after the neural revolution so that researchers can recognize the current state and future directions of this field. We provide an organization of the literature based on evolution, dataset, method, and evaluation. In addition, we compare and analyze competing models and evaluation metrics on different benchmarks. We uncover a number of insightful findings such as: 1) ZPT is in line with the development trend of large language model; 2) data limitation causes learning bias in languages and domains; 3) performance improvements are often reported on single benchmarks, but advanced methods are still far from realworld use; 4) general-purpose metrics are not reliable on nuances and complexities of ZPT, emphasizing the necessity of targeted metrics; 5) apart from commonly-cited errors, ZPs will cause risks of gender bias.

show abstract

Section: Discussion and Findingsmentioning

confidence: 99%