Multi-task Attention-based Neural Networks for Implicit Discourse
            Relationship Representation and Identification

Lan, Man; Wang, Jianxiang; Wu, Yuanbin; Niu, Zheng-Yu; Wang, Haifeng

doi:10.18653/v1/d17-1134

Cited by 95 publications

(85 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Early studies (Pitler et al, 2008;Lin et al, 2009Lin et al, , 2014 focused on extracting linguistic and semantic features from two discourse units. Recent research (Zhang et al, 2015;Ji and Eisenstein, 2015;Ji et al, 2016) tried to model compositional meanings of two discourse units by exploiting interactions between words in two units with more and more complicated neural network models, including the ones using neural tensor (Chen et al, 2016;Qin et al, 2016;Lei et al, 2017) and attention mechanisms Lan et al, 2017;). Another trend is to alleviate the shortage of annotated data by leveraging related external data, such as explicit discourse relations in PDTB Lan et al, 2017;Qin et al, 2017) and unlabeled data obtained elsewhere Lan et al, 2017), often in a multi-task joint learning framework.…”

Section: Implicit Discourse Relation Recognitionmentioning

confidence: 99%

Improving Implicit Discourse Relation Classification by Modeling Inter-dependencies of Discourse Units in a Paragraph

Dai¹,

Huang²

2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

View full text Add to dashboard Cite

We argue that semantic meanings of a sentence or clause can not be interpreted independently from the rest of a paragraph, or independently from all discourse relations and the overall paragraph-level discourse structure. With the goal of improving implicit discourse relation classification, we introduce a paragraph-level neural networks that model inter-dependencies between discourse units as well as discourse relation continuity and patterns, and predict a sequence of discourse relations in a paragraph. Experimental results show that our model outperforms the previous state-of-the-art systems on the benchmark corpus of PDTB.

show abstract

Section: Implicit Discourse Relation Recognitionmentioning

confidence: 99%

Improving Implicit Discourse Relation Classification by Modeling Inter-dependencies of Discourse Units in a Paragraph

Dai¹,

Huang²

2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

View full text Add to dashboard Cite

show abstract

“…Qin et al 2017;Lan et al 2017;Dai and Huang 2018;Lei et al 2018), our model still achieves F1 improvements of 1.53% on Comp. and 7.2% on Temp., the numbers of samples belonging to which are the two least in all classes as shown inTable 2.…”

mentioning

confidence: 70%

Shallow Convolutional Neural Network for Implicit Discourse Relation Recognition

Zhang¹,

Su²,

Xiong³

et al. 2015

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Implicit discourse relation recognition remains a serious challenge due to the absence of discourse connectives. In this paper, we propose a Shallow Convolutional Neural Network (SCNN) for implicit discourse relation recognition, which contains only one hidden layer but is effective in relation recognition. The shallow structure alleviates the overfitting problem, while the convolution and nonlinear operations help preserve the recognition and generalization ability of our model. Experiments on the benchmark data set show that our model achieves comparable and even better performance when comparing against current state-of-the-art systems.

show abstract

“…Recently, neural networks have shown an advantage of dealing with data sparsity problem, and many deep learning methods have been proposed for discourse parsing, including convolutional (Zhang et al, 2015), recurrent (Ji et al, 2016), character-based (Qin et al, 2016a), adversarial (Qin et al, 2017) neural networks, and pair-aware neural sentence modeling (Cai and Zhao, 2017). Multi-task learning has also been shown to be beneficial on this task (Lan et al, 2017).…”

Section: Implicit Discourse Relation Classificationmentioning

confidence: 99%

“…h e andh d are then combined using a linear layer (Lan et al, 2017). As illustrated in Equation 11, the linear layer acts as a gate to determine how much information from the sequence-to-sequence network should be mixed into the original sentence's representations from the encoder.…”

Section: Gated Interactionmentioning

confidence: 99%

Learning to Explicitate Connectives with Seq2Seq Network for Implicit Discourse Relation Classification

Wei¹,

Demberg²

2019

Proceedings of the 13th International Conference on Computational Semantics - Long Papers

View full text Add to dashboard Cite

Implicit discourse relation classification is one of the most difficult steps in discourse parsing. The difficulty stems from the fact that the coherence relation must be inferred based on the content of the discourse relational arguments. Therefore, an effective encoding of the relational arguments is of crucial importance. We here propose a new model for implicit discourse relation classification, which consists of a classifier, and a sequence-to-sequence model which is trained to generate a representation of the discourse relational arguments by trying to predict the relational arguments including a suitable implicit connective. Training is possible because such implicit connectives have been annotated as part of the PDTB corpus. Along with a memory network, our model could generate more refined representations for the task. And on the now standard 11-way classification, our method outperforms the previous state of the art systems on the PDTB benchmark on multiple settings including cross validation.

show abstract

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification

Cited by 95 publications

References 18 publications

Improving Implicit Discourse Relation Classification by Modeling Inter-dependencies of Discourse Units in a Paragraph

Improving Implicit Discourse Relation Classification by Modeling Inter-dependencies of Discourse Units in a Paragraph

Shallow Convolutional Neural Network for Implicit Discourse Relation Recognition

Learning to Explicitate Connectives with Seq2Seq Network for Implicit Discourse Relation Classification

Contact Info

Product

Resources

About