DocRED: A Large-Scale Document-Level Relation Extraction Dataset

Yao, Yuan; Ye, Deming; Li, Peng; Han, Xu; Lin, Yankai; Liu, Zhenghao; Liu, Zhiyuan; Huang, Lixin; Zhou, Jie; Sun, Maosong

doi:10.18653/v1/p19-1074

Cited by 294 publications

(390 citation statements)

References 32 publications

(30 reference statements)

Supporting

Mentioning

385

Contrasting

Unclassified

Order By: Relevance

“…For document-level RE, the input is a document with annotated entities, as well as multiple occurrences of each entity, i.e., entity mentions, the goal is to identify all the related entity pairs in the document. Following [15], we transform RE into a classification problem. We use upper case letters to represent entities (E 1 , · · · , E m ) and lower case letters to represent mentions (e 1 , · · · , e m ).…”

Section: Task Descriptionmentioning

confidence: 99%

“…To evaluate the effectiveness of our model, we use the DocRED dataset [15], which is the largest human-annotated document-level RE dataset constructed from Wikidata and Wikipedia. DocRED contains over 5,053 documents, 40,276 sentences, 132,375 entities and 96 frequent relation types.…”

Section: Datasetmentioning

confidence: 99%

“…We compare our model against the following document-level RE baselines: CNN/LSTM/BiLSTM-RE: They first encode a document into a hidden state vector sequence with CNN/LSTM/BiLSTM as encoder, and then predict relations for each entity pair by feeding them into a bilinear function [15]. Context-Aware: It uses an LSTM-based encoder to jointly learn representations for all relations in the context, and then combines other context relations with target relation to make the final prediction [12].…”

Section: Comparison Models and Evaluation Metricsmentioning

confidence: 99%

“…Recently, there has been increasing interest in document-level RE. Yao et al [15] proposed a large-scale human-annotated document-level RE dataset, DocRED, and first compute the representations for all entities then predict relations for each entity pair by feeding them into a bilinear function. Wang et al [13] used BERT to encode the document, it also used bilinear layer to predict the relation between entity pairs, but it modelled the document-level RE through a two-step process.…”

Section: Related Workmentioning

confidence: 99%

“…Obviously, most traditional sentence-level RE models often fail to generalize extraction to this situation. To move RE forward from sentence level to document level, many efforts have been made [13,15], but most previous methods used only entity-level information and this is not adequate. Thus, there are still some deep-seated problems unsolved in document-level RE.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

HIN: Hierarchical Inference Network for Document-Level Relation Extraction

Tang

Cao

Zhang

et al. 2020

Advances in Knowledge Discovery and Data Mining

View full text Add to dashboard Cite

Document-level RE requires reading, inferring and aggregating over multiple sentences. From our point of view, it is necessary for document-level RE to take advantage of multi-granularity inference information: entity level, sentence level and document level. Thus, how to obtain and aggregate the inference information with different granularity is challenging for document-level RE, which has not been considered by previous work. In this paper, we propose a Hierarchical Inference Network (HIN) to make full use of the abundant information from entity level, sentence level and document level. Translation constraint and bilinear transformation are applied to target entity pair in multiple subspaces to get entity-level inference information. Next, we model the inference between entity-level information and sentence representation to achieve sentence-level inference information. Finally, a hierarchical aggregation approach is adopted to obtain the document-level inference information. In this way, our model can effectively aggregate inference information from these three different granularities. Experimental results show that our method achieves state-of-the-art performance on the largescale DocRED dataset. We also demonstrate that using BERT representations can further substantially boost the performance.

show abstract

Section: Task Descriptionmentioning

confidence: 99%

Section: Datasetmentioning

confidence: 99%