Learning Unsupervised Multilingual Word Embeddings with Incremental Multilingual Hubs

Heyman, Geert; Verreet, Bregt; Vulić, Ivan; Moens, Marie‐Francine

doi:10.18653/v1/n19-1188

Cited by 22 publications

(18 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, the follow-up work aimed to improve the robustness of unsupervised CLWE induction by introducing more robust self-learning procedures (Artetxe et al, 2018b;Kementchedjhieva et al, 2018). Besides increased robustness, recent work claims that fully unsupervised projection-based CLWEs can even match or surpass their supervised counterparts Artetxe et al, 2018b;Alvarez-Melis and Jaakkola, 2018;Hoshen and Wolf, 2018;Heyman et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?

Vulić¹,

Glavaš²,

Reichart³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

Recent efforts in cross-lingual word embedding (CLWE) learning have predominantly focused on fully unsupervised approaches that project monolingual embeddings into a shared cross-lingual space without any cross-lingual signal. The lack of any supervision makes such approaches conceptually attractive. Yet, their only core difference from (weakly) supervised projection-based CLWE methods is in the way they obtain a seed dictionary used to initialize an iterative self-learning procedure. The fully unsupervised methods have arguably become more robust, and their primary use case is CLWE induction for pairs of resourcepoor and distant languages. In this paper, we question the ability of even the most robust unsupervised CLWE approaches to induce meaningful CLWEs in these more challenging settings. A series of bilingual lexicon induction (BLI) experiments with 15 diverse languages (210 language pairs) show that fully unsupervised CLWE methods still fail for a large number of language pairs (e.g., they yield zero BLI performance for 87/210 pairs). Even when they succeed, they never surpass the performance of weakly supervised methods (seeded with 500-1,000 translation pairs) using the same self-learning procedure in any BLI setup, and the gaps are often substantial. These findings call for revisiting the main motivations behind fully unsupervised CLWE methods.

show abstract

Section: Introductionmentioning

confidence: 99%

Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?

Vulić¹,

Glavaš²,

Reichart³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

show abstract

“…We solve the above two stages sequentially using known techniques. Our methodology contrasts with the existing unsupervised MWE methods (Alaux et al, 2019;Chen and Cardie, 2018;Heyman et al, 2019), which learn the unsupervised word alignments and the cross-lingual word embedding mappings jointly. Despite its apparent simplicity, we empirically observe that the proposed approach illustrates remarkable generalization ability and robustness.…”

Section: Unsupervised Multilingual Multi-stage Frameworkmentioning

confidence: 99%

“…They obtain the bilingual lexicons using the Gromov-Wasserstein approach (Alvarez-Melis and Jaakkola, 2018) and mapping operators between languages using the RCSLS algorithm (Joulin et al, 2018). Heyman et al (2019) propose to learn the shared multilingual space by incrementally adding languages to it, one in each iteration. Their approach is based on a reformulation of the bilingual self-learning algorithm proposed by Artetxe et al (2018b).…”

Section: Introductionmentioning

confidence: 99%

“…The subproblems are separately solved using existing techniques. In contrast, existing unsupervised multilingual approaches (Chen and Cardie, 2018;Heyman et al, 2019;Alaux et al, 2019) solve the above subproblems jointly. Though it appears like a simple baseline approach, the proposed framework provides the robustness and versatility often desired while learning an effective multilingual space for distant languages, which is a challenging setting for unsupervised methods (Søgaard et al, 2018;.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Simple Approach to Learning Unsupervised Multilingual Embeddings

Jawanpuria

Meghwanshi

Mishra

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Recent progress on unsupervised cross-lingual embeddings in the bilingual setting has given the impetus to learning a shared embedding space for several languages. A popular framework to solve the latter problem is to solve the following two sub-problems jointly: 1) learning unsupervised word alignment between several language pairs, and 2) learning how to map the monolingual embeddings of every language to shared multilingual space. In contrast, we propose a simple approach by decoupling the above two sub-problems and solving them separately, one after another, using existing techniques. We show that this proposed approach obtains surprisingly good performance in tasks such as bilingual lexicon induction, cross-lingual word similarity, multilingual document classification, and multilingual dependency parsing. When distant languages are involved, the proposed approach shows robust behavior and outperforms existing unsupervised multilingual word embedding approaches.

show abstract

“…Wada et al (2019) instead use a sentence-level neural language model for directly learning multilingual word embeddings and as a result bypassing the need for mapping functions. In the paradigm of aligning pre-trained word embeddings where we focus, Heyman et al (2019) propose a technique that iteratively builds a multilingual space starting from a monolingual space and incrementally incorporating languages to it. Even if this strategy deviates from the traditional TB/MP model, it still preserves the idea of having a pivot language.…”

Section: Related Workmentioning

confidence: 99%

Hierarchical Mapping for Crosslingual Word Embedding Alignment

Azpiazu

Pera

2020

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

The alignment of word embedding spaces in different languages into a common crosslingual space has recently been in vogue. Strategies that do so compute pairwise alignments and then map multiple languages to a single pivot language (most often English). These strategies, however, are biased towards the choice of the pivot language, given that language proximity and the linguistic characteristics of the target language can strongly impact the resultant crosslingual space in detriment of topologically distant languages. We present a strategy that eliminates the need for a pivot language by learning the mappings across languages in a hierarchical way. Experiments demonstrate that our strategy significantly improves vocabulary induction scores in all existing benchmarks, as well as in a new non-English–centered benchmark we built, which we make publicly available.

show abstract

Learning Unsupervised Multilingual Word Embeddings with Incremental Multilingual Hubs

Cited by 22 publications

References 25 publications

Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?

Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?

A Simple Approach to Learning Unsupervised Multilingual Embeddings

Hierarchical Mapping for Crosslingual Word Embedding Alignment

Contact Info

Product

Resources

About