Mention Flags (MF): Constraining Transformer-based Text Generators

Wang, Yufei; Wood, I. G.; Wan, Stephen; Dras, Mark; Johnson, Mark

doi:10.18653/v1/2021.acl-long.9

Cited by 11 publications

(6 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the future, we would explore extending KG-S2S to other Seq2Seq PLMs, such as BART (Lewis et al, 2020) and MASS (Song et al, 2019). In addition, it is interesting to combine KG-S2S with other knowledge-intensive NLP tasks, such as conversation recommendation (Li et al, 2018b) and commonsense generation (Wang et al, 2021b) in the Seq2Seq framework, and see if the KG knowledge could benefit these downstream tasks.…”

Section: Discussionmentioning

confidence: 99%

Knowledge Is Flat: A Seq2Seq Generative Framework for Various Knowledge Graph Completion

Chen¹,

Wang²,

Li³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Knowledge Graph Completion (KGC) has been recently extended to multiple knowledge graph (KG) structures, initiating new research directions, e.g. static KGC, temporal KGC and few-shot KGC (Ji et al., 2022). Previous works often design KGC models closely coupled with specific graph structures, which inevitably results in two drawbacks: 1) structurespecific KGC models are mutually incompatible; 2) existing KGC methods are not adaptable to emerging KGs. In this paper, we propose KG-S2S, a Seq2Seq generative framework that could tackle different verbalizable graph structures by unifying the representation of KG facts into "flat" text, regardless of their original form. To remedy the KG structure information loss from the "flat" text, we further improve the input representations of entities and relations, and the inference algorithm in KG-S2S. Experiments on five benchmarks show that KG-S2S outperforms many competitive baselines, setting new state-of-the-art performance. Finally, we analyze KG-S2S's ability on the different relations and the Non-entity Generations 1 .

show abstract

Section: Discussionmentioning

confidence: 99%

Knowledge Is Flat: A Seq2Seq Generative Framework for Various Knowledge Graph Completion

Chen¹,

Wang²,

Li³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…where score(b) is the score of the current beam state, logGen(w) is the output logit of the generator, f ( * ) are functions that scores word w weighted by α i , and V suc is a predefined vocabulary. Similarily, Mention Flags (Wang et al, 2021) tries to identify the presence of tokens in the hypothesis given a set of flags. Both methods face the same problem since they operate on surface tokens.…”

Section: Related Workmentioning

confidence: 99%

Guided Beam Search to Improve Generalization in Low-Resource Data-to-Text Generation

Garneau,

Lamontagne

2023

Proceedings of the 16th International Natural Language Generation Conference

View full text Add to dashboard Cite

In this paper, we introduce a new beam search algorithm that improves the generalization of neural generators to unseen examples, especially in low-resource data-to-text settings. Our algorithm aims to reduce the number of omissions and hallucinations during the decoding process. For this purpose, it relies on two regression models to explicitly characterize factual errors. We explain how to create a new dataset to train these models given an original training set of less than a thousand data points. We apply our approach in the low-resource, legal setting using the French Plum2Text dataset, as well as in English using WebNLG. We observe in our experiment that this combination improves the faithfulness of pre-trained neural text generators using both human and automatic evaluation. Moreover, our approach offers a level of interpretability by predicting the number of omissions and hallucinations present in a given generation with respect to the input data. Finally, we visualize our algorithm's exploration of the hypothesis space at different steps during the decoding process.

show abstract

“…These decomposition strategies showed high performance while introducing more detailed annotation to model training [5,7]. Inspired by the success of pretrained language models and the corresponding natural language generation-based paradigm for various NLP tasks [4,[21][22][23] tackle event extraction as controlled event generation. [6] is an end-to-end conditional generation method with manually designed discrete prompts for each event type, which needs more human effort to find the Fig.…”

Section: Related Workmentioning

confidence: 99%

KC-GEE: Knowledge-based Conditioning for Generative Event Extraction

Wu¹,

Shiri

Kang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Event extraction is an important but challenging task. Many existing techniques decompose it into event and argument detection/classification subtasks, which are complex structured prediction problems. Generation-based extraction techniques lessen the complexity of the problem formulation and are able to leverage the reasoning capabilities of large pretrained language models. However, they still suffer from poor zero-shot generalizability and are ineffective in handling long contexts such as documents. We propose a generative event extraction model, TC-GEE, that addresses these limitations. A key contribution of TC-GEE is a novel knowledge-based conditioning technique that injects the schema of candidate event types as the prefix into each layer of an encoder-decoder language model, thus enabling effective zero-shot learning and improving supervised learning. Our experiments on two benchmark datasets demonstrate the strong performance of our TC-GEE model. It achieves especially strong performance in the challenging document-level extraction task and in the zero-shot learning setting, outperforming state-of-the-art models by up to 27.7 absolute F1 points.

show abstract

Mention Flags (MF): Constraining Transformer-based Text Generators

Cited by 11 publications

References 27 publications

Knowledge Is Flat: A Seq2Seq Generative Framework for Various Knowledge Graph Completion

Knowledge Is Flat: A Seq2Seq Generative Framework for Various Knowledge Graph Completion

Guided Beam Search to Improve Generalization in Low-Resource Data-to-Text Generation

KC-GEE: Knowledge-based Conditioning for Generative Event Extraction

Contact Info

Product

Resources

About