Mapping Unseen Words to Task-Trained Embedding Spaces

Madhyastha, Pranava; Bansal, Mohit; Gimpel, Kevin; Livescu, Karen

doi:10.18653/v1/w16-1612

Cited by 9 publications

(2 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In all our methods, words not available in the GLoVe set are randomly initialized in the range ±0.05, indicating the lack of semantic information. By not mapping these words to a single random embedding, we mitigate against the errors that may arise due to their conflation (Madhyastha et al, 2015). A special OOV (out of vocabulary) token is also initialized in the same range.…”

Section: Classifying Contentmentioning

confidence: 99%

Author Profiling for Hate Speech Detection

Mishra,

Del Tredici,

Yannakoudakis

et al. 2019

Preprint

View full text Add to dashboard Cite

The rapid growth of social media in recent years has fed into some highly undesirable phenomena such as proliferation of abusive and offensive language on the Internet. Previous research suggests that such hateful content tends to come from users who share a set of common stereotypes and form communities around them. The current state-of-the-art approaches to hate speech detection are oblivious to user and community information and rely entirely on textual (i.e., lexical and semantic) cues. In this paper, we propose a novel approach to this problem that incorporates community-based profiling features of Twitter users. Experimenting with a dataset of 16k tweets, we show that our methods significantly outperform the current state of the art in hate speech detection. Further, we conduct a qualitative analysis of model characteristics. We release our code, pre-trained models and all the resources used in the public domain.

show abstract

Section: Classifying Contentmentioning

confidence: 99%

Author Profiling for Hate Speech Detection

Mishra,

Del Tredici,

Yannakoudakis

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

“…A tangential but noteworthy approach considers relations that are not curated in large graphs, but rather corpora annotated for inter-word relations such as syntactic dependencies(Madhyastha et al, 2016). Their system creates a mapping between a distributionally-obtained embedding table and one trained on the annotated parses, and generalizes this mapping to words which are now out-of-vocabulary for a further downstream task (e.g., sentiment analysis).…”

mentioning

confidence: 99%

Integrating Approaches to Word Representation

Pinter¹

2021

Preprint

View full text Add to dashboard Cite

The problem of representing the atomic elements of language in modern neural learning systems is one of the central challenges of the field of natural language processing. I present a survey of the distributional, compositional, and relational approaches to addressing this task, and discuss various means of integrating them into systems, with special emphasis on the word level and the out-of-vocabulary phenomenon.

show abstract