Evaluating Composition Models for Verb Phrase Elliptical Sentence Embeddings

Wijnholds, Gijs; Sadrzadeh, Mehrnoosh

doi:10.18653/v1/n19-1023

Cited by 12 publications

(20 citation statements)

References 30 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…IS USE BERTp BERT f 0.31 0.30 0.37 0.56 0.34 0.27 0.36 0.65 0.67 0.52 0.65 0.76 0.80 0.68 0.67 0.79 0.71 0.58 0.44 0.70 0.74 0.76 0.70 0.76 was 0.53, and provide equal results to the state of the art of ELLSIM, which was 0.76 , both reported in Wijnholds and Sadrzadeh (2019). However, they are surpassed by fine-tuned BERT sentence embeddings and sentence encoders, that achieve the highest.…”

Section: Elliptical Phrase and Sick Datasetsmentioning

confidence: 64%

“…(3,4) The transitive verb disambiguation datasets of Grefenstette and Sadrzadeh (2011) (GS11) and Kartsaklis and Sadrzadeh (2013) (KS13a), and (5) the transitive sentence similarity dataset of Kartsaklis et al (2013) (KS13b). (6,7) We additionally test on two recent datasets (Wijnholds and Sadrzadeh, 2019) (ELLDIS and ELLSIM), which extend the KS13a and KS13b datasets to sentences with verb phrase ellipsis in them.…”

Section: Verb Disambiguation and Sentence Similaritymentioning

confidence: 99%

See 1 more Smart Citation

Representation Learning for Type-Driven Composition

Wijnholds¹,

Sadrzadeh²,

Clark³

2020

Proceedings of the 24th Conference on Computational Natural Language Learning

Self Cite

View full text Add to dashboard Cite

This paper is about learning word representations using grammatical type information. We use the syntactic types of Combinatory Categorial Grammar to develop multilinear representations, i.e. maps with n arguments, for words with different functional types. The multilinear maps of words compose with each other to form sentence representations. We extend the skipgram algorithm from vectors to multilinear maps to learn these representations and instantiate it on unary and binary maps for transitive verbs. These are evaluated on verb and sentence similarity and disambiguation tasks and a subset of the SICK relatedness dataset. Our model performs better than previous typedriven models and is competitive with state of the art representation learning methods such as BERT and neural sentence encoders.

show abstract

Section: Elliptical Phrase and Sick Datasetsmentioning

confidence: 64%

Section: Verb Disambiguation and Sentence Similaritymentioning

confidence: 99%

Representation Learning for Type-Driven Composition

Wijnholds¹,

Sadrzadeh²,

Clark³

2020

Proceedings of the 24th Conference on Computational Natural Language Learning

Self Cite

View full text Add to dashboard Cite

show abstract

“…The match that our setting provides for human disambiguation judgements is being derived solely on the basis of observed cooccurrences between words and syntactic roles in a corpus, without any specification of content intrinsic to the word itself. Further experiments will be needed to extend this approach to larger datasets and to dialogue data and examine its effectiveness, perhaps using the work extending DS grammars to dialogue (Eshghi et al 2017), and possibly evaluating on the similarity dataset of Wijnholds and Sadrzadeh (2019) that extends the transitive sentence datasets used in this paper to a verb phrase elliptical setting.…”

Section: Discussionmentioning

confidence: 99%

“…Alternatively, one can build vectors for nouns and tensors for adjectives and verbs (and all other words with functional types) and use tensor contraction to build a vector for the sentence (Grefenstette and Sadrzadeh 2015;Kartsaklis and Sadrzadeh 2013). It has been shown that some of the tensor-based models improve on the results of the additive model, when considering the whole sentence (Grefenstette and Sadrzadeh 2015;Kartsaklis and Sadrzadeh 2013;Wijnholds and Sadrzadeh 2019); here, we focus on incremental composition as described above to investigate how the disambiguation process works word-by-word.…”

Section: A Disambiguation Taskmentioning

confidence: 99%

Incremental Composition in Distributional Semantics

Purver

Sadrzadeh

Kempson

et al. 2021

J of Log Lang and Inf

Self Cite

View full text Add to dashboard Cite

Despite the incremental nature of Dynamic Syntax (DS), the semantic grounding of it remains that of predicate logic, itself grounded in set theory, so is poorly suited to expressing the rampantly context-relative nature of word meaning, and related phenomena such as incremental judgements of similarity needed for the modelling of disambiguation. Here, we show how DS can be assigned a compositional distributional semantics which enables such judgements and makes it possible to incrementally disambiguate language constructs using vector space semantics. Building on a proposal in our previous work, we implement and evaluate our model on real data, showing that it outperforms a commonly used additive baseline. In conclusion, we argue that these results set the ground for an account of the non-determinism of lexical content, in which the nature of word meaning is its dependence on surrounding context for its construal.

show abstract

“…We use four disambiguation data sets to evaluate our models. Three of the four data sets -GS2011 (Grefenstette and Sadrzadeh, 2011a), GS2012, and KS2013-CoNLL (Kartsaklis et al, 2013) -are publicly available 5 , while ML2008 (Mitchell and Lapata, 2008) was obtained privately from the authors of Wijnholds and Sadrzadeh (2019). We show examples and statistics of the data sets in table 1.…”

Section: Data Setsmentioning

confidence: 99%

Modelling Lexical Ambiguity with Density Matrices

Meyer

Lewis²

2020

Proceedings of the 24th Conference on Computational Natural Language Learning

View full text Add to dashboard Cite

Words can have multiple senses. Compositional distributional models of meaning have been argued to deal well with finer shades of meaning variation known as polysemy, but are not so well equipped to handle word senses that are etymologically unrelated, or homonymy. Moving from vectors to density matrices allows us to encode a probability distribution over different senses of a word, and can also be accommodated within a compositional distributional model of meaning. In this paper we present three new neural models for learning density matrices from a corpus, and test their ability to discriminate between word senses on a range of compositional datasets. When paired with a particular composition method, our best model outperforms existing vector-based compositional models as well as strong sentence encoders.

show abstract

Evaluating Composition Models for Verb Phrase Elliptical Sentence Embeddings

Cited by 12 publications

References 30 publications

Representation Learning for Type-Driven Composition

Representation Learning for Type-Driven Composition

Incremental Composition in Distributional Semantics

Modelling Lexical Ambiguity with Density Matrices

Contact Info

Product

Resources

About