Learning Distributed Representations of Sentences from Unlabelled Data

Hill, Felix; Cho, Kyunghyun; Korhonen, Anna

doi:10.18653/v1/n16-1162

Cited by 413 publications

(510 citation statements)

References 27 publications

Supporting

Mentioning

499

Contrasting

Order By: Relevance

“…The simplest Average model achieves competitive results while the most complex LSTM model does not show advantages. Mikolov, 2014) 0.7561 FastSent (Hill et al, 2016) 0.7369 Char-CNN (Kim et al, 2016) 0.8095 Charagram (Wieting et al, 2016a) Table 1: Correlation coefficients of model predictions with subject similarity ratings on Chinese sentence similarity task. The bold data refers to best among models with same composition function.…”

Section: Resultsmentioning

confidence: 99%

Exploiting Word Internal Structures for Generic Chinese Sentence Representation

Wang¹,

Zhang²,

Zong³

2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

We introduce a novel mixed characterword architecture to improve Chinese sentence representations, by utilizing rich semantic information of word internal structures. Our architecture uses two key strategies. The first is a mask gate on characters, learning the relation among characters in a word. The second is a maxpooling operation on words, adaptively finding the optimal mixture of the atomic and compositional word representations. Finally, the proposed architecture is applied to various sentence composition models, which achieves substantial performance gains over baseline models on sentence similarity task.

show abstract

Section: Resultsmentioning

confidence: 99%

Exploiting Word Internal Structures for Generic Chinese Sentence Representation

Wang¹,

Zhang²,

Zong³

2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…A better continuous space vector representation of the messages might improve SD2 and SP2. Much research has been conducted recently on obtaining better continuous space vector representations of sentences (Le and Mikolov, 2014;Kiros et al, 2015;Hill et al, 2016) instead of centroid vectors. Another direction for future work would be to investigate replacing the SVM classifiers by multilayer perceptrons, possibly on top of recurrent neural nets that would compute vector representations of sentences.…”

Section: Discussionmentioning

confidence: 99%

aueb.twitter.sentiment at SemEval-2016 Task 4: A Weighted Ensemble of SVMs for Twitter Sentiment Analysis

Giorgis¹,

Rousas²,

Pavlopoulos

et al. 2016

Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

View full text Add to dashboard Cite

This paper describes the system with which we participated in SemEval-2016 Task 4 (Sentiment Analysis in Twitter) and specifically the Message Polarity Classification subtask. Our system is a weighted ensemble of two systems. The first one is based on a previous sentiment analysis system and uses manually crafted features. The second system of our ensemble uses features based on word embeddings. Our ensemble was ranked 5th among 34 teams. The source code of our system is publicly available.

show abstract

“…Lastly, we note other recent work that considers a similar transfer learning setting. The FastSent model (Hill et al, 2016) uses the 2014 STS task in its evaluation and reports an average Pearson's r of 61.3. On the same data, the C-PHRASE model (Pham et al, 2015) has an average Pearson's r of 65.7.…”

Section: Sentence Embedding Experimentsmentioning

confidence: 99%

Charagram: Embedding Words and Sentences via Character n-grams

Wieting

Bansal

Gimpel

et al. 2016

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

136

118

View full text Add to dashboard Cite

We present CHARAGRAM embeddings, a simple approach for learning character-based compositional models to embed textual sequences. A word or sentence is represented using a character n-gram count vector, followed by a single nonlinear transformation to yield a low-dimensional embedding. We use three tasks for evaluation: word similarity, sentence similarity, and part-of-speech tagging. We demonstrate that CHARAGRAM embeddings outperform more complex architectures based on character-level recurrent and convolutional neural networks, achieving new state-of-the-art performance on several similarity tasks.

show abstract

Learning Distributed Representations of Sentences from Unlabelled Data

Cited by 413 publications

References 27 publications

Exploiting Word Internal Structures for Generic Chinese Sentence Representation

Exploiting Word Internal Structures for Generic Chinese Sentence Representation

aueb.twitter.sentiment at SemEval-2016 Task 4: A Weighted Ensemble of SVMs for Twitter Sentiment Analysis

Charagram: Embedding Words and Sentences via Character n-grams

Contact Info

Product

Resources

About