Zero-Shot Information Extraction as a Unified Text-to-Triple Translation

Wang, Chenguang; Liu, Xiao; Chen, Zui; Hong, Haoyun; Tang, Jie; Song, Dawn

doi:10.18653/v1/2021.emnlp-main.94

Cited by 17 publications

(8 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, we compare our system with zeroshot task specific approaches from other authors when available. For RE, Wang et al (2021a) propose a text-to-triple translation method that given a text and a set of entities returns the existing relations. For EE, Lyu et al (2021) to us, the use of an entailment model, but in their case the input sentence is split in clauses according to the output of a Semantic Role Labelling system.…”

Section: Methodsmentioning

confidence: 99%

ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

Sainz¹,

Qiu²,

Lacalle³

et al. 2022

Preprint

View full text Add to dashboard Cite

The current workflow for Information Extraction (IE) analysts involves the definition of the entities/relations of interest and a training corpus with annotated examples. In this demonstration we introduce a new workflow where the analyst directly verbalizes the entities/relations, which are then used by a Textual Entailment model to perform zero-shot IE. We present the design and implementation of a toolkit with a user interface, as well as experiments on four IE tasks that show that the system achieves very good performance at zero-shot learning using only 5-15 minutes per type of a user's effort. Our demonstration system is open-sourced at https:// github.com/BBN-E/ZS4IE. A demonstration video is available at https:// vimeo.com/676138340.

show abstract

Section: Methodsmentioning

confidence: 99%

ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

Sainz¹,

Qiu²,

Lacalle³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…A few works [13,22,23] have explored LMs as-is or performing prompt engineering, which consists of finding the most appropriate prompt to solve some given task. Liu et al [13] surveys these methods for Natural Language Processing tasks.…”

Section: Language Models and In-context Learningmentioning

confidence: 99%

In-Context Learning for Data Generation in Aspect-Based Opinion Extraction

Moreira

Barbosa

2023

Preprint

View full text Add to dashboard Cite

In-Context Learning (ICL), a strategy that generates text by passing a list of input-output examples to a Large Language Model, has shown state-of-the-art performance (SOTA) for many Natural Language Processing tasks. In this work, we aim to apply ICL for the synthetic generation of opinionated triples (sentence, aspect, and opinion) to enhance the training of models in two Aspect-Based Sentiment Analysis (ABSA) tasks: Target-oriented Opinion Words Extraction (TOWE) and Aspect-Based Opinion Pair Extraction (AOPE). Our approach first adapts a pre-trained Language Model (LM) to a downstream task using Task-adaptive Pretraining (TAPT). Next, it uses ICL prompting approaches on the adapted LM to generate the opinionated triples. Finally, a post-processing step applies some heuristics to clean and format the triples. We evaluate the quality of these generated samples by augmenting existing benchmark datasets with them and comparing the performance of four SOTA models on these tasks. The results show that these models trained on the generated synthetic data outperform those trained on a small percentage of human-labelled data. We manually inspected these triples with three human raters, which validated their quality.

show abstract

“…Accounting for early approaches in the literature, Yates et al [66] proposed the first Open IE system by using a self-supervised learning approach; Fader et al [67] leveraged POS tag patterns; Del Corro and Gemulla [68] decomposed a sentence into clauses, and Stanovsky et al [69] created the first annotated corpus by an automatic translation from the Question-Answer Meaning Representation dataset and developed an Open IE system using a Bi-LSTM with a BIO tagging scheme. More recently, Ro et al [70] included two classifiers for predicate and arguments; they use hidden states of a BERT model to extract predicates, and then the concatenation of predicate average, BERT hidden sequence, and position embedding are used as inputs for multi-head attention blocks for argument extraction. Wang et al [71] proposed a text-to-triple translation framework that includes generating and ranking steps; it uses Beam search over BERT attention score to generate relevant triples and then rank the generated results using a contrastive pre-trained model.…”

Section: Relation Extractionmentioning

confidence: 99%

“…More recently, Ro et al [70] included two classifiers for predicate and arguments; they use hidden states of a BERT model to extract predicates, and then the concatenation of predicate average, BERT hidden sequence, and position embedding are used as inputs for multi-head attention blocks for argument extraction. Wang et al [71] proposed a text-to-triple translation framework that includes generating and ranking steps; it uses Beam search over BERT attention score to generate relevant triples and then rank the generated results using a contrastive pre-trained model. On the other hand, relation discovery aims at discovering unseen relation types from unsupervised data; e.g., [72] is a recent work in the literature that casts the task of relation discovery as a clustering task.…”

Section: Relation Extractionmentioning

confidence: 99%

Knowledge Mining: A Cross-disciplinary Survey

et al. 2022

View full text Add to dashboard Cite

Knowledge mining is a widely active research area across disciplines such as natural language processing (NLP), data mining (DM), and machine learning (ML). The overall objective of extracting knowledge from data source is to create a structured representation that allows researchers to better understand such data and operate upon it to build applications. Each mentioned discipline has come up with an ample body of research, proposing different methods that can be applied to different data types. A significant number of surveys have been carried out to summarize research works in each discipline. However, no survey has presented a cross-disciplinary review where traits from different fields were exposed to further stimulate research ideas and to try to build bridges among these fields. In this work, we present such a survey.

show abstract

Zero-Shot Information Extraction as a Unified Text-to-Triple Translation

Cited by 17 publications

References 23 publications

ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

In-Context Learning for Data Generation in Aspect-Based Opinion Extraction

Knowledge Mining: A Cross-disciplinary Survey

Contact Info

Product

Resources

About