Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu 2018
DOI: 10.18653/v1/n18-2058
|View full text |Cite
|
Sign up to set email alerts
|

Semi-Supervised Event Extraction with Paraphrase Clusters

Abstract: Supervised event extraction systems are limited in their accuracy due to the lack of available training data. We present a method for self-training event extraction systems by bootstrapping additional training data. This is done by taking advantage of the occurrence of multiple mentions of the same event instances across newswire articles from multiple sources. If our system can make a highconfidence extraction of some mentions in such a cluster, it can then acquire diverse training examples by adding the othe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(20 citation statements)
references
References 9 publications
(9 reference statements)
0
20
0
Order By: Relevance
“…They can extract high-quality event mentions for the given types, but cannot extract mentions for any new types. Recent studies Chan et al, 2019;Ferguson et al, 2018) leverage annotations for a few seen event types or several keywords provided for the new types to extract mentions for new types. However, all these studies assume all the target types are given, which is very costly when moving to a new scenario.…”
Section: Related Workmentioning
confidence: 99%
“…They can extract high-quality event mentions for the given types, but cannot extract mentions for any new types. Recent studies Chan et al, 2019;Ferguson et al, 2018) leverage annotations for a few seen event types or several keywords provided for the new types to extract mentions for new types. However, all these studies assume all the target types are given, which is very costly when moving to a new scenario.…”
Section: Related Workmentioning
confidence: 99%
“…and Zeng et al (2018) use distant supervision to generate largescale data from existing structured event knowledge in knowledge bases. Liao and Grishman (2010a), Huang and Riloff (2012a) and Ferguson et al (2018) conduct semi-supervised ED with bootstrapping. Nevertheless, due to the low coverage of existing knowledge bases as well as lack of advanced denoising mechanism, those weakly supervised methods still suffer from the problem of low coverage and noisy data.…”
Section: Related Workmentioning
confidence: 99%
“…As events have complicated structures, like different event types containing different arguments with different roles, computing event extraction confidence is normally with low accuracies. Liao and Grishman [209], [210] proposed to use only a part of extraction results from new samples for data expansion, in particular, the most confident trigger with its most confident argument. Based on their extraction model [94], the most confident ''<role, trigger>'' pairs are selected based on the product between probability from their trigger classifier and argument classifier.…”
Section: A Joint Data Expansion and Model Trainingmentioning
confidence: 99%