Background Event extraction is essential for natural language processing. In the biomedical field, the nested event phenomenon (event A as a participating role of event B) makes extracting this event more difficult than extracting a single event. Therefore, the performance of nested biomedical events is always underwhelming. In addition, previous works relied on a pipeline to build an event extraction model, which ignored the dependence between trigger recognition and event argument detection tasks and produced significant cascading errors. Objective This study aims to design a unified framework to jointly train biomedical event triggers and arguments and improve the performance of extracting nested biomedical events. Methods We proposed an end-to-end joint extraction model that considers the probability distribution of triggers to alleviate cascading errors. Moreover, we integrated the syntactic structure into an attention-based gate graph convolutional network to capture potential interrelations between triggers and related entities, which improved the performance of extracting nested biomedical events. Results The experimental results demonstrated that our proposed method achieved the best F1 score on the multilevel event extraction biomedical event extraction corpus and achieved a favorable performance on the biomedical natural language processing shared task 2011 Genia event corpus. Conclusions Our conditional probability joint extraction model is good at extracting nested biomedical events because of the joint extraction mechanism and the syntax graph structure. Moreover, as our model did not rely on external knowledge and specific feature engineering, it had a particular generalization performance.
BACKGROUND Extracting events is essential in natural language processing. In the biomedical field, the nested event phenomenon (event A is a participating role of event B) makes extracting this event more difficult than extracting a single event. Therefore, the performance of nested biomedical events is always underwhelming. In addition, previous works rely on a pipeline to build an event extraction model, which ignores the dependence between trigger recognition and event argument detection tasks and produces significant cascading errors. OBJECTIVE We aim to design a unified framework to train biomedical event triggers and arguments jointly, and improve the performance of extracting nested biomedical events. METHODS We proposed an end-to-end joint extraction model that considers the probability distribution of triggers to alleviate the cascading errors. Moreover, we integrate the syntactic structure into an attention-based gate GCN to capture potential interrelations between triggers and related entities, which improves the performance of extracting nested biomedical events. RESULTS The experimental results demonstrate that our proposed method achieves the best F1-score on the MLEE biomedical event extraction corpus and achieves a favorable performance on the BioNLP-ST 2011 GE corpus. CONCLUSIONS Our CPJE model is good at extracting nested biomedical events because of the joint extraction mechanism and the syntax graph structure. Moreover, because our model does not rely on external knowledge and specific feature engineering, it has a particular generalization performance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.