Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019) 2019
DOI: 10.18653/v1/w19-4318
|View full text |Cite
|
Sign up to set email alerts
|

Probing Multilingual Sentence Representations With X-Probe

Abstract: This paper extends the task of probing sentence representations for linguistic insight in a multilingual domain. In doing so, we make two contributions: first, we provide datasets for multilingual probing, derived from Wikipedia, in five languages, viz. English, French, German, Spanish and Russian. Second, we evaluate six sentence encoders for each language, each trained by mapping sentence representations to English sentence representations, using sentences in a parallel corpus. We discover that cross-lingual… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
20
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(20 citation statements)
references
References 43 publications
0
20
0
Order By: Relevance
“…2 The benchmark covers five languages: English, French, German, Spanish and Russian, derived from Wikipedia. The task set comprises 9 probing tasks, summarized in Table 1, that address varieties of linguistic properties including surface, syntactic, and semantic information Ravishankar et al, 2019). Ravishankar et al (2019) used the datasets to evaluate different sentence encoders trained by mapping sentence representations to English.…”
Section: Experimental Setups and Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…2 The benchmark covers five languages: English, French, German, Spanish and Russian, derived from Wikipedia. The task set comprises 9 probing tasks, summarized in Table 1, that address varieties of linguistic properties including surface, syntactic, and semantic information Ravishankar et al, 2019). Ravishankar et al (2019) used the datasets to evaluate different sentence encoders trained by mapping sentence representations to English.…”
Section: Experimental Setups and Resultsmentioning
confidence: 99%
“…The task set comprises 9 probing tasks, summarized in Table 1, that address varieties of linguistic properties including surface, syntactic, and semantic information Ravishankar et al, 2019). Ravishankar et al (2019) used the datasets to evaluate different sentence encoders trained by mapping sentence representations to English. Unlike Ravishankar et al ( 2019), we use the datasets to evaluate DCT embeddings for each language independently.…”
Section: Experimental Setups and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The majority of approaches for probing sentence embeddings target English, but recently some works have also addressed other languages such as Polish, Russian, or Spanish in a multiand cross-lingual setup (Krasnowska-Kieraś and Wróblewska, 2019;Ravishankar et al, 2019). Motivations for considering a multi-lingual analysis include knowing whether findings from English transfer to other languages and determining a universal set of probing tasks that suits multiple languages, e.g., with richer morphology and freer word order.…”
Section: Introductionmentioning
confidence: 99%