On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification

Liu, Xin; Ou, Jiefu; Song, Yangqiu; Jiang, Xin

doi:10.24963/ijcai.2020/530

Cited by 34 publications

(35 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Second, in most cases, jointly inferring multi-level labels (HierMTN-CRF-RoBERTa, OurEncoder+OurDecoder, LDSGM) performs better than separately predicting in BMGF-RoBERTa, Table 2: Comparison with recent models on the consistency among multi-level label predictions. We run the code of BMGF-RoBERTa (Liu et al 2020) and report the results.…”

Section: Resultsmentioning

confidence: 99%

“…Therefore, we report the label-wise F 1 scores for the second-level labels in Table 4. A closer look into the results reveals that though our LDSGM model outperforms BMGF-RoBERTa (Liu et al 2020) on most majority labels, the F 1 scores for three minority labels are still 0%. Besides, the BERT-based model (Kishimoto, Murawaki, and Kurohashi 2020) that small numbers of training examples are insufficient to optimize the huge amounts of parameters in these models.…”

Section: Performance On Minority Label Predictionsmentioning

confidence: 94%

“…With the rapid development of deep learning, previous IDRR models relying on human-designed features have evolved to neural network based models (Zhang et al 2015;Lei et al 2017;Xu et al 2019;Liu et al 2020;Zhang et al 2021). Despite their success, these studies usually train multiple models to predict multi-level labels independently, while ignoring the dependence between these hierarchically structured labels.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Label Dependence-aware Sequence Generation Model for Multi-level Implicit Discourse Relation Recognition

Wu¹,

Cao²,

Liu³

et al. 2021

Preprint

View full text Add to dashboard Cite

Implicit discourse relation recognition (IDRR) is a challenging but crucial task in discourse analysis. Most existing methods train multiple models to predict multi-level labels independently, while ignoring the dependence between hierarchically structured labels. In this paper, we consider multi-level IDRR as a conditional label sequence generation task and propose a Label Dependence-aware Sequence Generation Model (LDSGM) for it. Specifically, we first design a label attentive encoder to learn the global representation of an input instance and its level-specific contexts, where the label dependence is integrated to obtain better label embeddings. Then, we employ a label sequence decoder to output the predicted labels in a top-down manner, where the predicted higher-level labels are directly used to guide the label prediction at the current level. We further develop a mutual learning enhanced training method to exploit the label dependence in a bottomup direction, which is captured by an auxiliary decoder introduced during training. Experimental results on the PDTB dataset show that our model achieves the state-of-the-art performance on multi-level IDRR. We will release our code at https://github.com/nlpersECJTU/LDSGM.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Performance On Minority Label Predictionsmentioning

confidence: 94%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Label Dependence-aware Sequence Generation Model for Multi-level Implicit Discourse Relation Recognition

Wu¹,

Cao²,

Liu³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…For the classification subtasks, we use the BiMPM [31] multi-perspective symmetric matching model, originally proposed for sentence matching and used to extract implicit interactions between text spans [17]. In the BiMPM model, two textual inputs are encoded in the BiLSTM encoder and the generated vectors are matched in both directions on each time step.…”

Section: Models For Unlabeled Tree Construction and Discourse Relation Classificationmentioning

confidence: 99%

RST Discourse Parser for Russian: An Experimental Study of Deep Learning Models

Chistova¹,

Shelmanov

Pisarevskaya³

et al. 2021

Lecture Notes in Computer Science

View full text Add to dashboard Cite

This work presents the first fully-fledged discourse parser for Russian based on the Rhetorical Structure Theory of Mann and Thompson (1988). For the segmentation, discourse tree construction, and discourse relation classification we employ deep learning models. With the help of multiple word embedding techniques, the new state of the art for discourse segmentation of Russian texts is achieved. We found that the neural classifiers using contextual word representations outperform previously proposed feature-based models for discourse relation classification. By ensembling both methods, we are able to further improve the performance of the discourse relation classification achieving the new state of the art for Russian.

show abstract

“…As claim representations { ← − H i } and { − → H i } from RoBERTa are not bidirectional, we need to combine them and control which of them matters more. The gated fusion (Liu et al, 2020) has been shown of a better mixture than the combination of multihead attention and layer normalization. We use it to maintain the powerful representative features and carry useful historical context information:…”

Section: Bidirectional Representation Fusionmentioning

confidence: 99%

Exploring Discourse Structures for Argument Impact Classification

Liu

Song

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Discourse relations among arguments reveal logical structures of a debate conversation. However, no prior work has explicitly studied how the sequence of discourse relations influence a claim's impact. This paper empirically shows that the discourse relations between two arguments along the context path are essential factors for identifying the persuasive power of an argument. We further propose DISCOC to inject and fuse the sentence-level structural discourse information with contextualized features derived from large-scale language models. Experimental results and extensive analysis show that the attention and gate mechanisms that explicitly model contexts and texts can indeed help the argument impact classification task defined by , and discourse structures among the context path of the claim to be classified can further boost the performance.

show abstract

On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification

Cited by 34 publications

References 7 publications

A Label Dependence-aware Sequence Generation Model for Multi-level Implicit Discourse Relation Recognition

A Label Dependence-aware Sequence Generation Model for Multi-level Implicit Discourse Relation Recognition

RST Discourse Parser for Russian: An Experimental Study of Deep Learning Models

Exploring Discourse Structures for Argument Impact Classification

Contact Info

Product

Resources

About