Deriving Boolean structures from distributional                     vectors

Kruszewski, GermÃ¡n; Paperno, Denis; Baroni, Marco

doi:10.1162/tacl_a_00145

Cited by 38 publications

(38 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The semi-supervised model of Kruszewski et al (2015) also models entailment in a vector space, but they use a discrete vector space. They train a mapping from distributional semantic vectors to Boolean vectors such that feature inclusion respects a training set of entailment relations.…”

Section: Related Workmentioning

confidence: 99%

A Vector Space for Distributional Semantics for Entailment

Henderson

Popa

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

Distributional semantics creates vectorspace representations that capture many forms of semantic similarity, but their relation to semantic entailment has been less clear. We propose a vector-space model which provides a formal foundation for a distributional semantics of entailment. Using a mean-field approximation, we develop approximate inference procedures and entailment operators over vectors of probabilities of features being known (versus unknown). We use this framework to reinterpret an existing distributionalsemantic model (Word2Vec) as approximating an entailment-based model of the distributions of words in contexts, thereby predicting lexical entailment relations. In both unsupervised and semi-supervised experiments on hyponymy detection, we get substantial improvements over previous results. *

show abstract

Section: Related Workmentioning

confidence: 99%

A Vector Space for Distributional Semantics for Entailment

Henderson

Popa

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

show abstract

“…Indeed, showed that two existing lexical entailment models fail to account for similarity between the antecedent and consequent, and conclude that such models are only learning to predict prototypicality: that is, they predict that cat entails animal because animal is usually entailed, and therefore will also predict that sofa entails animal. Yet it remains unclear why such models make for such strong baselines (Weeds et al, 2014;Kruszewski et al, 2015;.…”

Section: Introductionmentioning

confidence: 99%

“…Research on lexical entailment using distributional semantics has now spanned more than a decade, and has been approached using both unsupervised (Weeds et al, 2004;Kotlerman et al, 2010;Lenci and Benotto, 2012;Santus, 2013) and supervised techniques (Baroni et al, 2012;Fu et al, 2014;Roller et al, 2014;Weeds et al, 2014;Kruszewski et al, 2015;Turney and Mohammad, 2015;Santus et al, 2016). Most of the work in unsupervised methods is based on the Distributional Inclusion Hypothesis (Weeds et al, 2004;Zhitomirsky-Geffet and Dagan, 2005), which states that the contexts in which a hypernym appear should be a superset over its hyponyms' contexts.…”

Section: Introductionmentioning

confidence: 99%

“…That is, when the training and test sets were carefully constructed to ensure they were completely disjoint, it performed extremely poorly. Nonetheless, Concat is continually used as a strong baseline in more recent work (Kruszewski et al, 2015).…”

Section: Introductionmentioning

confidence: 99%

“…Recently, other works have begun to analyze Concat and Diff for their ability to go beyond just hypernymy detection. Vylomova et al (2016) take an extensive look at Diff's ability to model a wide variety of lexical relations and conclude it is generally robust, and Kruszewski et al (2015) have success with a neural network model based on the Distributional Inclusion Hypothesis.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Relations such as Hypernymy: Identifying and Exploiting Hearst Patterns in Distributional Vectors for Lexical Entailment

Roller¹,

Erk²

2016

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

We consider the task of predicting lexical entailment using distributional vectors. We perform a novel qualitative analysis of one existing model which was previously shown to only measure the prototypicality of word pairs. We find that the model strongly learns to identify hypernyms using Hearst patterns, which are well known to be predictive of lexical relations. We present a novel model which exploits this behavior as a method of feature extraction in an iterative procedure similar to Principal Component Analysis. Our model combines the extracted features with the strengths of other proposed models in the literature, and matches or outperforms prior work on multiple data sets.

show abstract

Variations on Abstract Semantic Spaces

Erk

2020

The Philosophy and Science of Language

View full text Add to dashboard Cite

Deriving Boolean structures from distributional vectors

Cited by 38 publications

References 25 publications

A Vector Space for Distributional Semantics for Entailment

A Vector Space for Distributional Semantics for Entailment

Relations such as Hypernymy: Identifying and Exploiting Hearst Patterns in Distributional Vectors for Lexical Entailment

Variations on Abstract Semantic Spaces

Contact Info

Product

Resources

About