KC-GEE: Knowledge-based Conditioning for Generative Event Extraction

Wu, Tongning; Shiri, Fatemeh; Kang, Jingqi; Qi, Guilin; Haffari, Gholamreza; Li, Yuanfang

doi:10.21203/rs.3.rs-2190758/v1

Cited by 3 publications

(1 citation statement)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With the advances in deep learning, models based on neural networks have also been developed for this task (Nguyen and Nguyen, 2019;Zhang et al, 2019). More recently, several studies leverage the strong representation and reasoning capabilities of pre-trained language models (Li et al, 2021; Wu et al, 2022;Hsu et al, 2022).…”

Section: Related Workmentioning

confidence: 99%

Theia: Weakly Supervised Multimodal Event Extraction from Incomplete Data

Moghimifar,

Shiri,

Nguyen

et al. 2023

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacifi

View full text Add to dashboard Cite

Event extraction from multimodal documents is an important yet under-explored problem. One challenge faced by this task is the scarcity of paired image-text datasets, making it difficult to fully exploit the strong representation power of multimodal language models. In this paper, we present Theia, an end-to-end multimodal event extraction framework that can be trained on incomplete data. Specifically, we couple a generation-based event extraction model with a customised image synthesizer that can generate images from text. Our model leverages capabilities of pre-trained visionlanguage models and can be trained on incomplete (i.e. text-only) data. Experimental results on existing multimodal datasets demonstrate the effectiveness of our approach for both synthesising missing data and extracting events over state-of-the-art approaches.

show abstract