Data Augmentation Methods for Anaphoric Zero Pronouns

Aloraini, Abdulrahman; Poesio, Massimo

doi:10.18653/v1/2021.crac-1.9

Cited by 6 publications

(8 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Due to page limitation, some examples are mainly discussed in Chinese and/or English. However, most results and findings can be applied to other pro-drop languages, which is further supported by other works (Ri et al, 2021;Aloraini and Poesio, 2020;Vincent et al, 2022). In Appendix §A.1, we add details on the phenomenon in various pro-drop languages such as Arabic, Swahili, Portuguese, Hindi, and Japanese.…”

Section: Limitationssupporting

confidence: 79%

“…ZPT is a hard task to be done alone, researchers are investigating how to leverage other related NLP tasks to improve ZPT by training models to perform multiple tasks simultaneously (Wang et al, 2018a). Since ZPT is a cross-lingual problem, researchers are exploring techniques for training models that can work across multiple languages, rather than being limited to a single language (Aloraini and Poesio, 2020).…”

Section: Data-level Methods Do Not Change Modelmentioning

confidence: 99%

“…Modeling ZPs has so far not been extensively explored in prior research, largely due to the lack of publicly available data sets. Existing works mostly focused on human-annotated, small-scale and single-domain corpora such as OntoNotes (Pradhan et al, 2012;Aloraini and Poesio, 2020) and Treebanks (Yang and Xue, 2010;Chung and Gildea, 2010). We summarize representative corpora as:…”

Section: Overviewmentioning

confidence: 99%

See 2 more Smart Citations

A Survey on Zero Pronoun Translation

Wang¹,

Liu²,

Xu³

et al. 2023

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

Zero pronouns (ZPs) are frequently omitted in pro-drop languages (e.g. Chinese, Hungarian, and Hindi), but should be recalled in nonpro-drop languages (e.g. English). This phenomenon has been studied extensively in machine translation (MT), as it poses a significant challenge for MT systems due to the difficulty in determining the correct antecedent for the pronoun. This survey paper highlights the major works that have been undertaken in zero pronoun translation (ZPT) after the neural revolution so that researchers can recognize the current state and future directions of this field. We provide an organization of the literature based on evolution, dataset, method, and evaluation. In addition, we compare and analyze competing models and evaluation metrics on different benchmarks. We uncover a number of insightful findings such as: 1) ZPT is in line with the development trend of large language model; 2) data limitation causes learning bias in languages and domains; 3) performance improvements are often reported on single benchmarks, but advanced methods are still far from realworld use; 4) general-purpose metrics are not reliable on nuances and complexities of ZPT, emphasizing the necessity of targeted metrics; 5) apart from commonly-cited errors, ZPs will cause risks of gender bias.

show abstract

Section: Limitationssupporting

confidence: 79%

Section: Data-level Methods Do Not Change Modelmentioning

confidence: 99%

Section: Overviewmentioning

confidence: 99%

See 1 more Smart Citation

A Survey on Zero Pronoun Translation

Wang¹,

Liu²,

Xu³

et al. 2023

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

show abstract

“…Another mentionpair model that was developed for the Persian language extracted hand-crafted, embedding-based, and rich semantic features of mentions and used them as input to a fully connected neural network for coreference resolution (Sahlani et al, 2020). The adaptation of an English mention-ranking model (Lee et al, 2008) to Arabic was enhanced with performance-related improvements such as the heuristic-based preprocessing of words and the use of a separately trained mention detection approach (Aloraini et al, 2020). A Siamese network architecture and an extended feature set of mentions were used for Polish coreference resolution (Niton et al, 2018).…”

Section: Introductionmentioning

confidence: 99%

“…In the first model, a set of wellstudied features by existing literature (Bengtson and Roth, 2008;Durrett and Klein, 2013;Wiseman et al, 2015) are extracted for a mention and its candidate antecedents and then fed to a single-layer feed-forward neural network as input. Our second model closely follows the mention ranking approach of the end-to-end coreference solution proposed by Lee et al (2007) which was successfully applied to other languages including Arabic (Aloraini et al, 2020) and Slovenian (Klemen and Žitnik, 2022). The contextual representations of a mention and its candidate antecedent mentions are learned from pre-trained language models, and a probability distribution is obtained over all possible pairings of the mention with candidate antecedents using a two-layer feed-forward network.…”

Section: Introductionmentioning

confidence: 99%

Neural Coreference Resolution for Turkish

Demir

2023

jista

View full text Add to dashboard Cite

Coreference resolution deals with resolving mentions of the same underlying entity in a given text. This challenging task is an indispensable aspect of text understanding and has important applications in various language processing systems such as question answering and machine translation. Although a significant amount of studies is devoted to coreference resolution, the research on Turkish is scarce and mostly limited to pronoun resolution. To our best knowledge, this article presents the first neural Turkish coreference resolution study where two learning-based models are explored. Both models follow the mention-ranking approach while forming clusters of mentions. The first model uses a set of hand-crafted features whereas the second coreference model relies on embeddings learned from large-scale pre-trained language models for capturing similarities between a mention and its candidate antecedents. Several language models trained specifically for Turkish are used to obtain mention representations and their effectiveness is compared in conducted experiments using automatic metrics. We argue that the results of this study shed light on the possible contributions of neural architectures to Turkish coreference resolution.

show abstract