Comparing Word Representations for Implicit Discourse Relation Classification

Braud, Chloé; Pelletier, Denis

doi:10.18653/v1/d15-1262

Cited by 54 publications

(51 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With the release of PDTB 2.0 (Prasad et al, 2008), lots of work has been done for discourse relation identification on natural (i.e., genuine) discourse data (Pitler et al, 2009;Lin et al, 2009;Wang et al, 2010;Zhou et al, 2010;Braud and Denis, 2015;Fisher and Simmons, 2015) with the use of traditional NLP linguistically informed features and machine learning algorithms. Recently, more and more researchers resorted to neural networks for implicit discourse recognition (Zhang et al, 2015;Qin et al, 2016a;Liu and Li, 2016;Braud and Denis, 2016;Wu et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification

Lan¹,

Wang²,

Wu³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

We present a novel multi-task attentionbased neural network model to address implicit discourse relationship representation and identification through two types of representation learning, an attentionbased neural network for learning discourse relationship representation with two arguments and a multi-task framework for learning knowledge from annotated and unannotated corpora. The extensive experiments have been performed on two benchmark corpora (i.e., PDTB and CoNLL-2016 datasets). Experimental results show that our proposed model outperforms the state-of-the-art systems on benchmark corpora.

show abstract

Section: Introductionmentioning

confidence: 99%

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification

Lan¹,

Wang²,

Wu³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…Max pooling is known to be very effective in vision, but it is unclear what pooling function works well when it comes to pooling word vectors. Summation pooling and mean pooling have been claimed to perform well at composing meaning of a short phrase from individual word vectors (Le and Mikolov, 2014;Blacoe and Lapata, 2012;Mikolov et al, 2013b;Braud and Denis, 2015). The Arg1 vector a 1 and Arg2 vector a 2 are computed by applying element-wise pooling function f on all of the N 1 word vectors in Arg1 w 1 1:N 1 and all of the N 2 word vectors in Arg2 w 2 1:N 2 respectively:…”

Section: Bag-of-words Feedforward Modelmentioning

confidence: 99%

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

Rutherford

Demberg

Xue

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1

View full text Add to dashboard Cite

Inferring implicit discourse relations in natural language text is the most difficult subtask in discourse parsing. Many neural network models have been proposed to tackle this problem. However, the comparison for this task is not unified, so we could hardly draw clear conclusions about the effectiveness of various architectures. Here, we propose neural network models that are based on feedforward and long-short term memory architecture and systematically study the effects of varying structures. To our surprise, the best-configured feedforward architecture outperforms LSTM-based model in most cases despite thorough tuning. Further, we compare our best feedforward system with competitive convolutional and recurrent networks and find that feedforward can actually be more effective. For the first time for this task, we compile and publish outputs from previous neural and nonneural systems to establish the standard for further comparison.

show abstract

“…Recently the distributed word representations (Bengio et al, 2003;Mikolov et al, 2013) have shown an advantage in dealing with data sparsity problem (Braud and Denis, 2015). Many deep learning methods have been proved to be helpful in discourse relation parsing and achieved some significant progresses.…”

Section: Background On Discourse Relationmentioning

confidence: 99%

Do We Need Cross Validation for Discourse Relation Classification?

Wei¹,

Demberg

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2

View full text Add to dashboard Cite

The task of implicit discourse relation classification has received increased attention in recent years, including two CoNNL shared tasks on the topic. Existing machine learning models for the task train on sections 2-21 of the PDTB and test on section 23, which includes a total of 761 implicit discourse relations. In this paper, we'd like to make a methodological point, arguing that the standard test set is too small to draw conclusions about whether the inclusion of certain features constitute a genuine improvement, or whether one got lucky with some properties of the test set, and argue for the adoption of cross validation for the discourse relation classification task by the community.

show abstract

Comparing Word Representations for Implicit Discourse Relation Classification

Cited by 54 publications

References 20 publications

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

Do We Need Cross Validation for Discourse Relation Classification?

Contact Info

Product

Resources

About