2020
DOI: 10.48550/arxiv.2003.04983
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Understanding the Downstream Instability of Word Embeddings

Megan Leszczynski,
Avner May,
Jian Zhang
et al.

Abstract: Many industrial machine learning (ML) systems require frequent retraining to keep up-to-date with constantly changing data. This retraining exacerbates a large challenge facing ML systems today: model training is unstable, i.e., small changes in training data can cause significant changes in the model's predictions. In this paper, we work on developing a deeper understanding of this instability, with a focus on how a core building block of modern natural language processing (NLP) pipelines-pre-trained word emb… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 17 publications
(33 reference statements)
0
1
0
Order By: Relevance
“…To measure quality, Wendlandt et al [29] and Hellrich and Hahn [12] discuss analyzing word embeddings with respect to an embedding's nearest neighbors. The work of Leszczynski et al [17] is uniquely looking at the quality of an embedding with respect to a downstream task. The authors define the metric of downstream instability, the number of predictions that change with different embeddings, to measure downstream embedding instability.…”
Section: Embedding Managementmentioning
confidence: 99%
“…To measure quality, Wendlandt et al [29] and Hellrich and Hahn [12] discuss analyzing word embeddings with respect to an embedding's nearest neighbors. The work of Leszczynski et al [17] is uniquely looking at the quality of an embedding with respect to a downstream task. The authors define the metric of downstream instability, the number of predictions that change with different embeddings, to measure downstream embedding instability.…”
Section: Embedding Managementmentioning
confidence: 99%