Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events

Ballesteros, Miguel; Anubhai, Rishita; Wang, Shuai; Pourdamghani, Nima; Vyas, Yogarshi; Ma, Jie; Bhatia, Parminder; McKeown, Kathleen; Al-Onaizan, Yaser

doi:10.18653/v1/2020.emnlp-main.436

Cited by 17 publications

(17 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table 3 compares our work to the baseline methods reported on the TDDMan, TDDAuto, MATRES, and TimeBank-Dense datasets. We also include results for BERT-based Transformer (Devlin et al, 2019) and RoBERTa (Liu et al, 2019) following Ballesteros et al (2020). To prevent truncation or memory errors otherwise caused by multi-sentence spans, we concatenate only sentences containing source and events as input to Transformer baselines.…”

Section: Resultsmentioning

confidence: 99%

“…Prior work focuses on extracting temporal relations between event pairs (a.k.a., TLINKS) present in the same sentence (Intra-sentence TLINKS) or adjacent sentences (Inter-sentence TLINKS), mostly ignoring document-level pairs (Crossdocument TLINKS) (Reimers et al, 2016). Past works have used RNN (Cheng and Miyao, 2017;Meng et al, 2017;Goyal and Durrett, 2019;Ning et al, 2019;Han et al, 2019aHan et al, ,c,b, 2020b and Transformer networks (Ballesteros et al, 2020;Zhao et al, 2020b) for encoding a few sentences or a short paragraph but do not capture longrange dependencies and multi-hop reasoning at the document-level. This shortcoming is shown in the TDDiscourse dataset (Naik et al, 2019), which was designed to highlight global discourse-level challenges, e.g., multi-hop chain reasoning, future or hypothetical events, and reasoning requiring world knowledge.…”

Section: Introductionmentioning

confidence: 99%

“…Results comparing performance of TIMERS with baselines and ablative components on TDDMan, TDDAuto, MATRES and TimeBank-Dense datasets. We adopt the BERT and RoBERTa implementation from(Ballesteros et al, 2020). * indicates statistical significance over BERT Transformer (p ≤ 0.005) under Wilcoxon's Signed Rank test.…”

mentioning

confidence: 99%

See 2 more Smart Citations

TIMERS: Document-level Temporal Relation Extraction

Mathur¹,

Jain²,

Dernoncourt³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

We present TIMERS -a TIME, Rhetorical and Syntactic-aware model for document-level temporal relation classification. Our proposed method leverages rhetorical discourse features and temporal arguments from semantic role labels, in addition to traditional local syntactic features, trained through a Gated Relational-GCN. Extensive experiments show that the proposed model outperforms previous methods by 5-18% on the TDDiscourse, TimeBank-Dense, and MATRES datasets due to our discourse-level modeling.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

TIMERS: Document-level Temporal Relation Extraction

Mathur¹,

Jain²,

Dernoncourt³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

show abstract

“…We note that the problem of temporal graph extraction is different from the more popular task of Temporal relation extraction (Temprel), which deals with classifying the temporal link between two already extracted events. State of the art Temprel systems use neural methods (Ballesteros et al, 2020;Ning et al, 2019b;Goyal and Durrett, 2019;Han et al, 2019;Cheng and Miyao, 2017), but typically use a handful of documents for their development and evaluation. Vashishtha et al (2019) are a notable exception by using Amazon Mechanical Turks to obtain manual annotations over a larger dataset of 16,000 sentences.…”

Section: Temporal Relation Extractionmentioning

confidence: 99%

Neural Language Modeling for Contextualized Temporal Graph Generation

Madaan¹,

Yang²

2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

This paper presents the first study on using large-scale pre-trained language models for automated generation of an event-level temporal graph for a document. Despite the huge success of neural pre-training methods in NLP tasks, its potential for temporal reasoning over event graphs has not been sufficiently explored. Part of the reason is the difficulty in obtaining large training corpora with humanannotated events and temporal links. We address this challenge by using existing IE/NLP tools to automatically generate a large quantity (89,000) of system-produced document-graph pairs, and propose a novel formulation of the contextualized graph generation problem as a sequence-to-sequence mapping task. These strategies enable us to leverage and fine-tune pre-trained language models on the systeminduced training data for the graph generation task. Our experiments show that our approach is highly effective in generating structurally and semantically valid graphs. Further, evaluation on a challenging hand-labeled, out-ofdomain corpus shows that our method outperforms the closest existing method by a large margin on several metrics. We also show a downstream application of our approach by adapting it to answer open-ended temporal questions in a reading comprehension setting. 1

show abstract

“…The tasks are binary or multiple classification problems. Note the dataset of MATRES is split at the article level as in the previous work[15].2. MATRES[18] is a pairwise event temporal ordering prediction dataset, where each event pair in one document is annotated with a temporal relation (Before, After, Equal, Vague).…”

mentioning

confidence: 99%

CoCoLM: COmplex COmmonsense Enhanced Language Model with Discourse Relations

Yu¹,

Zhang²,

Song³

et al. 2020

Preprint

View full text Add to dashboard Cite

Large-scale pre-trained language models have demonstrated strong knowledge representation ability. However, recent studies suggest that even though these giant models contains rich simple commonsense knowledge (e.g., bird can fly and fish can swim.), they often struggle with the complex commonsense knowledge that involves multiple eventualities (verb-centric phrases, e.g., identifying the relationship between "Jim yells at Bob" and "Bob is upset"). To address this problem, in this paper, we propose to help pre-trained language models better incorporate complex commonsense knowledge. Different from existing fine-tuning approaches, we do not focus on a specific task and propose a general language model named CoCoLM. Through the careful training over a large-scale eventuality knowledge graphs ASER, we successfully teach pre-trained language models (i.e., BERT and RoBERTa) rich complex commonsense knowledge among eventualities. Experiments on multiple downstream commonsense tasks that requires the correct understanding of eventualities demonstrate the effectiveness of CoCoLM.

show abstract

Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events

Cited by 17 publications

References 20 publications

TIMERS: Document-level Temporal Relation Extraction

TIMERS: Document-level Temporal Relation Extraction

Neural Language Modeling for Contextualized Temporal Graph Generation

CoCoLM: COmplex COmmonsense Enhanced Language Model with Discourse Relations

Contact Info

Product

Resources

About