When Are Tree Structures Necessary for Deep Learning of Representations?

Li, Jiwei; Luong, Thang; Jurafsky, Dan; Hovy, Eduard

doi:10.18653/v1/d15-1278

Cited by 156 publications

(124 citation statements)

References 25 publications

Supporting

Mentioning

119

Contrasting

Unclassified

Order By: Relevance

“…This has allowed for the simplified creation of large labeled datasets, ideal for the application of unsupervised learning methods. Approaches have continued to evolve, both in terms of problem formulation (Chen et al, 2015) and as the full weight of modern machine learning techniques have been brought to bear (Socher et al, 2013;Tai et al, 2015;Li et al, 2015).…”

Section: Study 1: Sentiment Analysismentioning

confidence: 99%

Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis

et al. 2017

View full text Add to dashboard Cite

Theory-driven text analysis has made extensive use of psychological concept dictionaries, leading to a wide range of important results. These dictionaries have generally been applied through word count methods which have proven to be both simple and effective. In this paper, we introduce Distributed Dictionary Representations (DDR), a method that applies psychological dictionaries using semantic similarity rather than word counts. This allows for the measurement of the similarity between dictionaries and spans of text ranging from complete documents to individual words. We show how DDR enables dictionary authors to place greater emphasis on construct validity without sacrificing linguistic coverage. We further demonstrate the benefits of DDR on two real-world tasks and finally conduct an extensive study of the interaction between dictionary size and task performance. These studies allow us to examine how DDR and word count methods complement one another as tools for applying concept dictionaries and where each is best applied. Finally, we provide references to tools and resources to make this method both available and accessible to a broad psychological audience. Keywords Methodological innovation · Text analysis · Semantic representation · Dictionary-based text analysisElectronic supplementary material The online version of this article

show abstract

Section: Study 1: Sentiment Analysismentioning

confidence: 99%

Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis

et al. 2017

View full text Add to dashboard Cite

show abstract

“…Neural network classifiers are popular for relation extraction recently. Many of them focus on fully supervised settings, recurrent neural networks (RNN) and convolutional neural networks (CNN) (Vu et al, 2016;Zeng et al, 2015;Xu et al, 2015a;Xu et al, 2015b;Zhang and Wang, 2015), sequence models and tree models are investigated (Li et al, 2015;dos Santos et al, 2015). One similar network structure to our model is proposed in (Miwa and Bansal, 2016).…”

Section: Related Workmentioning

confidence: 99%

Large-scale Opinion Relation Extraction with Distantly Supervised Neural Network

Sun

Lan

et al. 2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1

View full text Add to dashboard Cite

We investigate the task of open domain opinion relation extraction. Given a large number of unlabelled texts, we propose an efficient distantly supervised framework based on pattern matching and neural network classifiers. The patterns are designed to automatically generate training data, and the deep learning model is designed to capture various lexical and syntactic features. The result algorithm is fast and scalable on large-scale corpus. We test the system on the Amazon online review dataset, and show that the proposed model is able to achieve promising performances without any human annotations.

show abstract

“…Compared with previous neural models, we keep the advantage of convolutional neural network (Nguyen and Grishman, 2015) in capturing local contexts. Besides, we also incorporate a Bi-directional LSTM to model the preceding and following information of a word as it has been commonly accepted that LSTM is good at capturing long-term dependencies in a sequence (Tang et al, 2015b;Li et al, 2015a).…”

Section: Related Workmentioning

confidence: 99%

A Language-Independent Neural Network for Event Detection

Feng¹,

Huang²,

Tang³

et al. 2016

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

Event detection remains a challenge due to the difficulty at encoding the word semantics in various contexts. Previous approaches heavily depend on languagespecific knowledge and pre-existing natural language processing (NLP) tools. However, compared to English, not all languages have such resources and tools available. A more promising approach is to automatically learn effective features from data, without relying on languagespecific resources. In this paper, we develop a hybrid neural network to capture both sequence and chunk information from specific contexts, and use them to train an event detector for multiple languages without any manually encoded features. Experiments show that our approach can achieve robust, efficient and accurate results for multiple languages (English, Chinese and Spanish).

show abstract

When Are Tree Structures Necessary for Deep Learning of Representations?

Cited by 156 publications

References 25 publications

Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis

Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis

Large-scale Opinion Relation Extraction with Distantly Supervised Neural Network

A Language-Independent Neural Network for Event Detection

Contact Info

Product

Resources

About