Representation of Word Meaning in the Intermediate Projection Layer of a Neural Language Model

Derby, Steven; Miller, Paul; Murphy, Brian; Devereux, Barry

doi:10.18653/v1/w18-5449

Cited by 1 publication

(2 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several studies investigate the relation between semantic features recorded in feature norm datasets (McRae et al, 2005;Devereux et al, 2014;Vinson and Vigliocco, 2008;Vigliocco et al, 2004) and embedding vectors (Fagarasan et al, 2015;Tsvetkov et al, 2015Tsvetkov et al, , 2016Herbelot and Vecchi, 2015;Herbelot, 2013;Riordan and Jones, 2011;Glenberg and Robertson, 2000;Derby et al, 2018;Forbes et al, 2019;Rubinstein et al, 2015). These studies indicate that (at least partial) mappings between distributional and conceptual spaces are possible and that conceptual knowledge can complement distributional representations.…”

Section: Related Workmentioning

confidence: 99%

“…For instance, in the CSLB norms (Devereux et al, 2014), has legs is listed for several birds, but not for owl, duck, and eagle. This introduces noise to already rather small datasets used to investigate property knowledge in distributional data (Derby et al, 2018). Yaghoobzadeh et al (2019) apply diagnostic classification to investigate semantic classes using a large, automatically generated dataset derived from Wikipedia, which is likely to contain noise.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models

Sommerauer¹

2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop

View full text Add to dashboard Cite

What do powerful models of word meaning created from distributional data (e.g. Word2vec (Mikolov et al., 2013) BERT (Devlin et al., 2019) and ELMO (Peters et al., 2018)) represent? What causes words to be similar in the semantic space? What type of information is lacking? This thesis proposal presents a framework for investigating the information encoded in distributional semantic models. Several analysis methods have been suggested, but they have been shown to be limited and are not well understood. This approach pairs observations made on actual corpora with insights obtained from data manipulation experiments. The expected outcome is a better understanding of (1) the semantic information we can infer purely based on linguistic co-occurrence patterns and (2) the potential of distributional semantic models to pick up linguistic evidence.

show abstract