Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension

An, Yang; Wang, Quan; Liu, Jing; Liu, Kai; Lyu, Yajuan; Wu, Hua; She, Qiaoqiao; Li, Sujian

doi:10.18653/v1/p19-1226

Cited by 122 publications

(67 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table 4 shows EM (%) and F 1 (%) of human performance, the PSH-SJTU system as well as baselines on the development and test sets of task 2. Compared with the best baseline, KT-NET (Yang et al, 2019a), PSH-SJTU achieves significantly better scores. On the hidden test set, they improve EM by 10.08%, and F 1 by 8.98%.…”

Section: Participantsmentioning

confidence: 94%

See 1 more Smart Citation

Commonsense Inference in Natural Language Processing (COIN) - Shared Task Report

Ostermann¹,

Zhang²,

Roth³

et al. 2019

Proceedings of the First Workshop on Commonsense Inference in Natural Language Processing

View full text Add to dashboard Cite

This paper reports on the results of the shared tasks of the COIN workshop at EMNLP-IJCNLP 2019. The tasks consisted of two machine comprehension evaluations, each of which tested a system's ability to answer questions/queries about a text. Both evaluations were designed such that systems need to exploit commonsense knowledge, for example, in the form of inferences over information that is available in the common ground but not necessarily mentioned in the text. A total of five participating teams submitted systems for the shared tasks, with the best submitted system achieving 90.6% accuracy and 83.7% F1-score on task 1 and task 2, respectively.

show abstract

Section: Participantsmentioning

confidence: 94%

“…KT-NET (Yang et al, 2019a) employs an attention mechanism to adaptively select desired knowledge from knowledge bases, and then fuses selected knowledge with BERT to enable contextand knowledge-aware predictions for machine reading comprehension. (Seo et al, 2016) and self-attention, both of which are widely used in MC models.…”

Section: Task 2 Baselinesmentioning

confidence: 99%

Commonsense Inference in Natural Language Processing (COIN) - Shared Task Report

Ostermann¹,

Zhang²,

Roth³

et al. 2019

Proceedings of the First Workshop on Commonsense Inference in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…The first one, mutual attention, is aimed at fusing the question representations into the passage so as to obtain the question-aware passage representations; the second one, self-attention, is aimed at fusing the question-aware passage representations into themselves so as to obtain the final passage representations. Yang et al [35] proposed KT-NET, which employs an attention mechanism to adaptively select desired knowledge from KBs, and then fuses selected knowledge with BERT to enable context-and knowledge-aware predictions. However, most previous methods retrieve the relevant knowledge base before encoding it into the MRC, which can only refer to KB locally.…”

Section: Knowledge Fusionmentioning

confidence: 99%

Towards Knowledge Enhanced Language Model for Machine Reading Comprehension

et al. 2020

View full text Add to dashboard Cite

“…This has led many to leverage knowledge graphs (KGs) (Mihaylov and Frank, 2018;Lin et al, 2019;Yang et al, 2019). KGs represent relational knowledge between entities with multi-relational edges for models to acquire.…”

Section: Introductionmentioning

confidence: 99%

Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering

Feng¹,

Chen²,

Lin³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

134

100

View full text Add to dashboard Cite

Existing work that augment question answering (QA) models with external knowledge (e.g., knowledge graphs) either struggle to model multi-hop relations efficiently, or lack transparency into the model's prediction rationale. In this paper, we propose a novel knowledge-aware approach that equips pretrained language models (PTLMs) with a multi-hop relational reasoning module, named multi-hop graph relation network (MHGRN). It performs multi-hop, multi-relational reasoning over subgraphs extracted from external knowledge graphs. The proposed reasoning module unifies path-based reasoning methods and graph neural networks and results in better interpretability and scalability. We also empirically show its effectiveness and scalability on CommonsenseQA and OpenbookQA datasets, and interpret its behaviors with case studies, with the code for experiments released 1 .

show abstract

Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension

Cited by 122 publications

References 27 publications

Commonsense Inference in Natural Language Processing (COIN) - Shared Task Report

Commonsense Inference in Natural Language Processing (COIN) - Shared Task Report

Towards Knowledge Enhanced Language Model for Machine Reading Comprehension

Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering

Contact Info

Product

Resources

About