Revisiting the Context Window for Cross-lingual Word Embeddings

Ri, Ryokan; Tsuruoka, Yoshimasa

doi:10.18653/v1/2020.acl-main.94

Cited by 8 publications

(4 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“… 84 , 85 Word embeddings are created by identifying the words that occur within a “Context Window” - defined by a string of words before and after a “centre” word. 86 , 87 The centre word and context words are represented as a vector of numbers (word2vec) to evaluate the presence or absence of unique words in the dataset. We use word2vec v0.3.4 in R to perform training and computations.…”

Section: Methodsmentioning

confidence: 99%

Conspiracy spillovers and geoengineering

Debnath¹,

Reiner²,

Sovacool³

et al. 2023

iScience

View full text Add to dashboard Cite

Section: Methodsmentioning

confidence: 99%

Conspiracy spillovers and geoengineering

Debnath¹,

Reiner²,

Sovacool³

et al. 2023

iScience

View full text Add to dashboard Cite

“…The difference between the Nesting and Flat languages is striking in Figure 4f. The Nesting encoders are consistently better at capturing the local contextual information (at positions −2 ∼ 2) than their flat counterparts, which may explain the better performance of the Nesting encoders in dependency parsing (Figure 3), given that the local contextual information is particularly important to predict the syntactic characteristics of words (Levy and Goldberg, 2014;Ri and Tsuruoka, 2020).…”

Section: Resultsmentioning

confidence: 98%

Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models

Ri¹,

Tsuruoka²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

We investigate what kind of structural knowledge learned in neural network encoders is transferable to processing natural language. We design artificial languages with structural properties that mimic natural language, pretrain encoders on the data, and see how much performance the encoder exhibits on downstream tasks in natural language. Our experimental results show that pretraining with an artificial language with a nesting dependency structure provides some knowledge transferable to natural language. A follow-up probing analysis indicates that its success in the transfer is related to the amount of encoded contextual information and what is transferred is the knowledge of position-aware context dependence of language. Our results provide insights into how neural network encoders process human languages and the source of crosslingual transferability of recent multilingual language models.

show abstract

“…VecMap does not scale well without the use of a GPU, and hence hyperparameter searching was not done for this work. However, using vectors with 128 dimensions and a larger window size of 10 as suggested by Ri and Tsuruoka (2020) resulted in a performance decrease, even for the English news articles.…”

Section: Limitationsmentioning

confidence: 99%

Unsupervised Cross-lingual Word Embedding Representation for English-isiZulu

Ngomane,

Mabuya,

Abbott

et al. 2023

Proceedings of the Fourth Workshop on Resources for African Indigenous Languages (RAIL 2023)

View full text Add to dashboard Cite

In this study, we investigate the effectiveness of using cross-lingual word embeddings for zero-shot transfer learning between a language with an abundant resource, English, and a language with limited resource, isiZulu. IsiZulu is a part of the South African Nguni language family, which is characterised by complex agglutinating morphology. We use VecMap, an open source tool, to obtain cross-lingual word embeddings. To perform an extrinsic evaluation of the effectiveness of the embeddings, we train a news classifier on labelled English data in order to categorise unlabelled isiZulu data using zero-shot transfer learning. In our study, we found our model to have a weighted average F1-score of 0.34. Our findings demonstrate that VecMap generates modular word embeddings in the cross-lingual space that have an impact on the downstream classifier used for zero-shot transfer learning.

show abstract

Revisiting the Context Window for Cross-lingual Word Embeddings

Cited by 8 publications

References 24 publications

Conspiracy spillovers and geoengineering

Conspiracy spillovers and geoengineering

Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models

Unsupervised Cross-lingual Word Embedding Representation for English-isiZulu

Contact Info

Product

Resources

About