A Discriminative Neural Model for Cross-Lingual Word Alignment

Stengel-Eskin, Elias; Su, Tzu-Ray; Post, Matt; Durme, Benjamin Van

doi:10.18653/v1/d19-1084

Cited by 19 publications

(22 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Li et al (2019) propose two methods to extract alignments from NMT models, however they do not outperform fast-align. Stengel-Eskin et al (2019) compute similarity matrices of encoder-decoder representations that are leveraged for word alignments, together with supervised learning, which requires manually annotated alignment. We find our proposed methods to be competitive with these approaches.…”

Section: Part-of-speech Analysismentioning

confidence: 99%

SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings

Sabet¹,

Dufter²,

Yvon³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Word alignments are useful for tasks like statistical and neural machine translation (NMT) and cross-lingual annotation projection. Statistical word aligners perform well, as do methods that extract alignments jointly with translations in NMT. However, most approaches require parallel training data, and quality decreases as less training data is available. We propose word alignment methods that require no parallel data. The key idea is to leverage multilingual word embeddings -both static and contextualized -for word alignment. Our multilingual embeddings are created from monolingual data only without relying on any parallel data or dictionaries. We find that alignments created from embeddings are superior for four and comparable for two language pairs compared to those produced by traditional statistical aligners -even with abundant parallel data; e.g., contextualized embeddings achieve a word alignment F 1 for English-German that is 5 percentage points higher than eflomal, a high-quality statistical aligner, trained on 100k parallel sentences.

show abstract

Section: Part-of-speech Analysismentioning

confidence: 99%

SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings

Sabet¹,

Dufter²,

Yvon³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

show abstract

“…It formalizes word alignment as a collection of SQuAD-style span prediction problems (Rajpurkar et al, 2016) and solves them with multilingual BERT (Devlin et al, 2019). We experimentally show that our proposed model significantly outperformed both (Garg et al, 2019) and (Stengel-Eskin et al, 2019).…”

Section: Introductionmentioning

confidence: 90%

“…Most previous works that use them for word alignment (Yang et al, 2013;Tamura et al, 2014;Legrand et al, 2016) achieved accuracies that are basically comparable to GIZA++. However, the accuracy of recent works (Garg et al, 2019;Stengel-Eskin et al, 2019;Zenkel et al, 2020) based on the Transformer (Vaswani et al, 2017), which is the state-of-the art neural machine translation model, have started to outperform GIZA++. Garg et al (2019) made the attention of the Transformer more closely resembled the word alignment, and achieved better accuracy than GIZA++ when they used alignments obtained from it for supervision.…”

Section: Introductionmentioning

confidence: 99%

“…Stengel-Eskin et al (2019) proposed a supervised word alignment method using the hidden states of the Transformer, and significantly outperformed FastAlign (11-27 F1 points) using a small number of gold word alignments (1.7K-5K sentences). However, both Garg et al (2019) and Stengel-Eskin et al (2019) required more than a million parallel sentences to pretrain their models. Applying these methods to low-resource language pairs and domains is difficult.…”

Section: Introductionmentioning

confidence: 99%

“…Our main contribution is that we make supervised word alignment more practical. Stengel-Eskin et al (2019) argued that supervised word alignment is a viable option. They concluded that alignment annotation could be performed rapidly 4.4 sentences per minute by annotators with minimal experience using a web-based crowd-sourcing interface.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT

Nagata¹,

Chousa²,

Nishino³

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

We present a novel supervised word alignment method based on cross-language span prediction. We first formalize a word alignment problem as a collection of independent predictions from a token in the source sentence to a span in the target sentence. Since this step is equivalent to a SQuAD v2.0 style question answering task, we solve it using the multilingual BERT, which is fine-tuned on manually created gold word alignment data. It is nontrivial to obtain accurate alignment from a set of independently predicted spans. We greatly improved the word alignment accuracy by adding to the question the source token's context and symmetrizing two directional predictions. In experiments using five word alignment datasets from among Chinese, Japanese, German, Romanian, French, and English, we show that our proposed method significantly outperformed previous supervised and unsupervised word alignment methods without any bitexts for pretraining. For example, we achieved 86.7 F1 score for the Chinese-English data, which is 13.3 points higher than the previous state-of-the-art supervised method. 1

show abstract

Chapter 11. Word alignment in the Russian-Chinese parallel corpus

Politova,

Bonetskaya,

Dolgov

et al. 2023

Studies in Corpus Linguistics

View full text Add to dashboard Cite

The Russian-Chinese parallel corpus (RuZhCorp) was created in 2016 by sinologists and computational linguists. So far, it has accumulated 1 074 texts and over 4.6 million words that are aligned on a sentence level. To produce word alignment for the entire corpus, we used deep neural networks trained both on the whole RuZhCorp and on a manually aligned at a word level gold dataset. Using the principles presented in previous publications, we compiled the first word-to-word alignment guideline for the Russian-Chinese language pair, which makes the manual alignment process less ambiguous and more consistent. The joint fine-tuning of the LaBSE deep learning model on RuZhCorp and the gold dataset achieved the best AER of 18.9%.

show abstract

A Discriminative Neural Model for Cross-Lingual Word Alignment

Cited by 19 publications

References 28 publications

SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings

SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings

A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT

Chapter 11. Word alignment in the Russian-Chinese parallel corpus

Contact Info

Product

Resources

About