2018
DOI: 10.48550/arxiv.1806.06259
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Evaluation of sentence embeddings in downstream and linguistic probing tasks

Abstract: Despite the fast developmental pace of new sentence embedding methods, it is still challenging to find comprehensive evaluations of these different techniques. In the past years, we saw significant improvements in the field of sentence embeddings and especially towards the development of universal sentence encoders that could provide inductive transfer to a wide variety of downstream tasks. In this work, we perform a comprehensive evaluation of recent methods using a wide variety of downstream and linguistic f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
12
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 12 publications
(13 citation statements)
references
References 22 publications
1
12
0
Order By: Relevance
“…To obtain sentence embeddings from individual words, we perform a weighted average of the word embeddings and use the TF-IDF scores of individual words as weight factors. It is a simple yet effective method to obtain sentence embedding for downstream tasks, as noted by previous work [28], [29]. This is shown in detail as Equation 2, where w j is the vector encoding the GloVe embedding of word x j :…”
Section: B Api Representation Learning and Matchingmentioning
confidence: 99%
“…To obtain sentence embeddings from individual words, we perform a weighted average of the word embeddings and use the TF-IDF scores of individual words as weight factors. It is a simple yet effective method to obtain sentence embedding for downstream tasks, as noted by previous work [28], [29]. This is shown in detail as Equation 2, where w j is the vector encoding the GloVe embedding of word x j :…”
Section: B Api Representation Learning and Matchingmentioning
confidence: 99%
“…The standard validation is employed. Being Different from the work in [61] that uses logistic regression for the WC task in the category of surface information, we use the same MLP model to provide simple yet fair comparison.…”
Section: Probing Tasksmentioning
confidence: 99%
“…One of the major contributions is called word embedding. There are various types of word embedding in the literature that is well covered by Perone et al (2018).…”
Section: Related Workmentioning
confidence: 99%