Promoting Graph Awareness in Linearized Graph-to-Text Generation

Alexander, Hoyle,; Marasovi, Ana; Smith, Noah A.

doi:10.18653/v1/2021.findings-acl.82

Cited by 14 publications

(14 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We find that training only 5.1% task-specific parameters, STRUCTADAPT-RGCN achieves a BLEU score of 46.6 in LDC2017T10, substantially improving over FINE-TUNE and other lightweight baselines (ADAPT, FT-TOP2, FT-BOTTOM2), and outperforming Ribeiro et al (2020a) and Hoyle et al (2021) which fine-tune T5 updating significantly more parameters. STRUCTADAPT also achieves stateof-the-art performance on LDC2020T02, considerably improving over Bevilacqua et al (2021), which implicitly models the graph structure information using linearization techniques.…”

Section: Resultsmentioning

confidence: 66%

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Ribeiro¹,

Zhang²,

Gurevych³

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Pretrained language models (PLM) have recently advanced graph-to-text generation, where the input graph is linearized into a sequence and fed into the PLM to obtain its representation. However, efficiently encoding the graph structure in PLMs is challenging because such models were pretrained on natural language, and modeling structured data may lead to catastrophic forgetting of distributional knowledge. In this paper, we propose STRUCTADAPT, an adapter method to encode graph structure into PLMs. Contrary to prior work, STRUCTADAPT effectively models interactions among the nodes based on the graph connectivity, only training graph structure-aware adapter parameters. In this way, we incorporate task-specific knowledge while maintaining the topological structure of the graph. We empirically show the benefits of explicitly encoding graph structure into PLMs using STRUCTADAPT, outperforming the state of the art on two AMR-to-text datasets, training only 5.1% of the PLM parameters. 1

show abstract

Section: Resultsmentioning

confidence: 66%

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Ribeiro¹,

Zhang²,

Gurevych³

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…Radev et al (2020) propose DART, a new data-to-text dataset, and train a BART model gradually augmenting the WebNLG training data with DART data. Hoyle et al (2021) explore scaffolding objectives in PLMs and show gains in low-resource graph-to-text settings. Different from the above works, we focus on a general transfer learning strategies for graph-to-text generation, investigating task-adaptive pretraining approaches, employing additional collected task-specific data for different PLMs (BART and T5) and benchmarks.…”

Section: Related Workmentioning

confidence: 99%

Investigating Pretrained Language Models for Graph-to-Text Generation

Ribeiro¹,

Schmitt²,

Schütze³

et al. 2021

Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI

View full text Add to dashboard Cite

Graph-to-text generation aims to generate fluent texts from graph-based data. In this paper, we investigate two recent pretrained language models (PLMs) and analyze the impact of different task-adaptive pretraining strategies for PLMs in graph-to-text generation. We present a study across three graph domains: meaning representations, Wikipedia knowledge graphs (KGs) and scientific KGs. We show that approaches based on PLMs BART and T5 achieve new state-of-the-art results and that task-adaptive pretraining strategies improve their performance even further. We report new state-of-the-art BLEU scores of 49.72 on AMR-LDC2017T10, 59.70 on WebNLG, and 25.66 on AGENDA datasets -a relative improvement of 31.8%, 4.5%, and 42.4%, respectively, with our models generating significantly more fluent texts than human references. In an extensive analysis, we identify possible reasons for the PLMs' success on graph-totext tasks. Our findings suggest that the PLMs benefit from similar facts seen during pretraining or fine-tuning, such that they perform well even when the input graph is reduced to a simple bag of node and edge labels. 1

show abstract

“…Ribeiro et al (2020a) investigate encoder-decoder PLMs for graph-to-text generation, and show that adaptive pretraining can lead to notable improvements and that PLMs benefit much more from the graph structure of AMRs than of KGs. Hoyle et al (2020) explore the extent to which PLMs are invariant to graph linearization, finding that models trained on canonical linearizations fail to generalize to meaning-preserving alternatives. Compared to this line of work, which tunes all PLM parameters, our method obtains a further 19x reduction in task-specific parameters, tuning only 5.1% while maintaining comparable performance.…”

Section: Related Workmentioning

confidence: 99%

“…We thus are interested in measuring the impact of the graph linearization in the models. Following Hoyle et al (2020), we explore three different graph linearizations: (i) CANON: the original order of the canonical human-created linearizations in AMR corpora; (ii) RECONF: the order from the canonical linearization is ignored, except for the top node; 5 and (iii) RANDOM: constructs a linearization from a random node in the graph, disregarding all order information from the canonical format, but it remains a valid traversal of the graph. All linearizations are converted to a sequence of nodes and edges labels using depth-first traversal and used for both training and evaluation.…”

Section: Robustness To Graph Linearizationmentioning

confidence: 99%

Structural Adapters in Pretrained Language Models for AMR-to-text Generation

Ribeiro¹,

Zhang²,

Gurevych³

2021

Preprint

View full text Add to dashboard Cite

Previous work on text generation from graphstructured data relies on pretrained language models (PLMs) and utilizes graph linearization heuristics rather than explicitly considering the graph structure. Efficiently encoding the graph structure in PLMs is challenging because they were pretrained on natural language, and modeling structured data may lead to catastrophic forgetting of distributional knowledge. In this paper, we propose STRUC-TADAPT, an adapter method to encode graph structure into PLMs. Contrary to prior work, STRUCTADAPT effectively models interactions among the nodes based on the graph connectivity, only training graph structure-aware adapter parameters. In this way, we avoid catastrophic forgetting while maintaining the topological structure of the graph. We empirically show the benefits of explicitly encoding graph structure into PLMs using adapters and achieve state-of-the-art results on two AMRto-text datasets, training only 5.1% of the PLM parameters.

show abstract

Promoting Graph Awareness in Linearized Graph-to-Text Generation

Cited by 14 publications

References 28 publications

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Investigating Pretrained Language Models for Graph-to-Text Generation

Structural Adapters in Pretrained Language Models for AMR-to-text Generation

Contact Info

Product

Resources

About