The mechanism of additive composition

Tian, Ran; Okazaki, Naoaki; Inui, Kentaro

doi:10.1007/s10994-017-5634-8

Cited by 18 publications

(15 citation statements)

References 47 publications

(74 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…First, choose a word w. Then, for each window s containing w, take the average of the vectors of the words in s and denote it as v s . Now, take the average of v s for all the windows s containing w, and denote the average as u. Theorem 1 says that u can be mapped to the word vector v w by a linear transformation that does not depend on w. This linear structure may also have connections to some other phenomena related to linearity, e.g., Gittens et al (2017) and Tian et al (2017). Exploring such connections is left for future work.…”

Section: Gaussian Walk Modelmentioning

confidence: 99%

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Arora

Liang

et al. 2018

TACL

134

157

View full text Add to dashboard Cite

Word embeddings are ubiquitous in NLP and information retrieval, but it is unclear what they represent when the word is polysemous.Here it is shown that multiple word senses reside in linear superposition within the word embedding and simple sparse coding can recover vectors that approximately capture the senses. The success of our approach, which applies to several embedding methods, is mathematically explained using a variant of the random walk on discourses model (Arora et al., 2016). A novel aspect of our technique is that each extracted word sense is accompanied by one of about 2000 "discourse atoms" that gives a succinct description of which other words co-occur with that word sense. Discourse atoms can be of independent interest, and make the method potentially more useful. Empirical tests are used to verify and support the theory.

show abstract

Section: Gaussian Walk Modelmentioning

confidence: 99%

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Arora

Liang

et al. 2018

TACL

134

157

View full text Add to dashboard Cite

show abstract

“…However, previous model designs mostly rely on linguistic intuitions (Paperno et al, 2014, inter alia), whereas our model has an exact logic interpretation. Furthermore, by using additive composition we enjoy a learning guarantee (Tian et al, 2015).…”

Section: Sentence Completionmentioning

confidence: 99%

“…The rationale for our model is as follows. First, recent research has shown that additive composition of word vectors is an approximation to the situation where two words have overlapping context (Tian et al, 2015); therefore, it is suitable to implement an "and" or intersection operation (Section 3). We design our model such that the resulted distributional representations are expected to have additive compositionality.…”

Section: Introductionmentioning

confidence: 99%

Learning Semantically and Additively Compositional Distributional Representations

Tian

Okazaki

Inui

2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Self Cite

View full text Add to dashboard Cite

This paper connects a vector-based composition model to a formal semantics, the Dependency-based Compositional Semantics (DCS). We show theoretical evidence that the vector compositions in our model conform to the logic of DCS. Experimentally, we show that vector-based composition brings a strong ability to calculate similar phrases as similar vectors, achieving near state-of-the-art on a wide range of phrase similarity tasks and relation classification; meanwhile, DCS can guide building vectors for structured queries that can be directly executed. We evaluate this utility on sentence completion task and report a new state-of-the-art.

show abstract

“…This inequality reflects the difference between the importance of the constituents (i.e. the word embeddings Tian et al (2017), coefficients α, β are scalars drawn from a monotonic function. In this work, we consider that a reasonable choice for such a monotonic function is the Shannon's entropy (Shannon, 1949;Charniak, 1996;Aizawa, 2003).…”

Section: Composition In Distributional Semanticsmentioning

confidence: 99%

Unsupervised sentence representations as word information series: Revisiting TF–IDF

Arroyo-Fernández

Méndez-Cruz

Sierra

et al. 2019

Computer Speech & Language

View full text Add to dashboard Cite

Sentence representation at the semantic level is a challenging task for Natural Language Processing and Artificial Intelligence. Despite the advances in word embeddings (i.e. word vector representations), capturing sentence meaning is an open question due to complexities of semantic interactions among words. In this paper, we present an embedding method, which is aimed at learning unsupervised sentence representations from unlabeled text. We propose an unsupervised method that models a sentence as a weighted series of word embeddings. The weights of the word embeddings are fitted by using Shannon's word entropies provided by the Term Frequency-Inverse Document Frequency (TF-IDF) transform. The hyperparameters of the model can be selected according to the properties of data (e.g. sentence length and textual gender). Hyperparameter selection involves word embedding methods and dimensionalities, as well as weighting schemata. Our method offers advantages over existing methods: identifiable modules, short-term training, online inference of (unseen) sentence representations, as well as independence from domain, external knowledge and language resources. Results showed that our model outperformed the state of the art in well-known Semantic Textual Similarity (STS) benchmarks. Moreover, our model reached state-of-the-art performance when compared to supervised and knowledge-based STS systems.

show abstract

The mechanism of additive composition

Cited by 18 publications

References 47 publications

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Learning Semantically and Additively Compositional Distributional Representations

Unsupervised sentence representations as word information series: Revisiting TF–IDF

Contact Info

Product

Resources

About