Few-Shot NLG with Pre-Trained Language Model

Chen, Zhiyu; Eavani, Harini; Chen, Wenhu; Liu, Yinyin; Wang, William Yang

doi:10.18653/v1/2020.acl-main.18

Cited by 76 publications

(107 citation statements)

References 20 publications

(28 reference statements)

Supporting

Mentioning

106

Contrasting

Order By: Relevance

“…Most recently, large-scale pre-trained models (Radford et al, 2019;Song et al, 2019;Raffel et al, 2019) have achieved new state-ofthe-arts on various generation tasks. Chen et al (2019b) demonstrate that a simple pre-training based method can achieve very reasonable performance on the WikiBio dataset (Lebret et al, 2016) under few-shot setting. More recent works begin to focus on fidelity preserving of the generation, such as (Dhingra et al, 2019;Tian et al, 2019).…”

Section: Related Workmentioning

confidence: 89%

“…Considering that acquiring a large amount of (logical form, description) pairs in real-world cases is expensive, we also include a few-shot learning task for our dataset, where the model is only provided with hundreds of paired examples. Previous works have shown that the pre-trained language models obtain strong NLG performance even with a handful of fine-tuning instances (Chen et al, 2019b). Therefore we still use the best-performing GPT-2 model for this study.…”

Section: Few-shot Settingmentioning

confidence: 99%

“…Natural language generation (NLG) from structured data has been an important research problem in many applications. Recent data-driven methods have achieved good performances on various NLG tasks (Liu et al, 2018;Freitag and Roy, 2018;Chen et al, 2019b). However most studies focus on surface descriptions of simple record sequences, for example, attribute-value pairs of fixed or very limited schema, like E2E (Novikova et al, 2017) and WikiBio (Lebret et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Logic2Text: High-Fidelity Natural Language Generation from Logical Forms

Chen¹,

Chen²,

Zha³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

Self Cite

View full text Add to dashboard Cite

Previous studies on Natural Language Generation (NLG) from structured data have primarily focused on surface-level descriptions of record sequences. However, for complex structured data, e.g., multi-row tables, it is often desirable for an NLG system to describe interesting facts from logical inferences across records. If only provided with the table, it is hard for existing models to produce controllable and high-fidelity logical generations. In this work, we formulate highfidelity NLG as generation from logical forms in order to obtain controllable and faithful generations. We present a new large-scale dataset, LOGIC2TEXT, with 10,753 descriptions involving common logic types paired with the underlying logical forms.The logical forms show diversified graph structure of free schema, which pose great challenges on the model's ability to understand the semantics. We experiment on (1) Fullysupervised training with the full datasets, and (2) Few-shot setting, provided with hundreds of paired examples; We compare several popular generation models and analyze their performances. We hope our dataset can encourage research towards building an advanced NLG system capable of natural, faithful, and human-like generation. The dataset and code is available at https://github. com/czyssrs/Logic2Text.

show abstract

Section: Related Workmentioning

confidence: 89%

Section: Few-shot Settingmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Logic2Text: High-Fidelity Natural Language Generation from Logical Forms

Chen¹,

Chen²,

Zha³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

Self Cite

View full text Add to dashboard Cite

show abstract

“…In similar studies, pretrained GPT models (Radford et al, 2019) were used by Chen et al (2020) and Peng et al (2020), who fine-tune them on a small set of in-domain data, but they did not distill these models into ones suitable for production. Interestingly, Wen et al (2016) demonstrated that the structure of arguments in existing dialogues can be used to guide data collection for low-resource domain adaptation, which is similar to the bucketing strategies explored here.…”

Section: Related Workmentioning

confidence: 99%

“…More recently, advances in neural-network-based (conditional) language generation prompted a new direction in NLG research (Novikova et al, 2017;Budzianowski et al, 2018;Chen et al, 2020;Bal-akrishnan et al, 2019;Peng et al, 2020). The process is typically split into two steps: (1) serialization of input data into a flattened meaning representation (MR), and (2) using the neural generation model to generate a natural language response conditioned on the MR.…”

Section: Introductionmentioning

confidence: 99%

Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data

Arun¹,

Batra

Bhardwaj

et al. 2020

Proceedings of the 28th International Conference on Computational Linguistics: Industry Track

View full text Add to dashboard Cite

Natural language generation (NLG) is a critical component in conversational systems, owing to its role of formulating a correct and natural text response. Traditionally, NLG components have been deployed using template-based solutions. Although neural network solutions recently developed in the research community have been shown to provide several benefits, deployment of such model-based solutions has been challenging due to high latency, correctness issues, and high data needs. In this paper, we present approaches that have helped us deploy data-efficient neural solutions for NLG in conversational systems to production. We describe a family of sampling and modeling techniques to attain production quality with light-weight neural network models using only a fraction of the data that would be necessary otherwise, and show a thorough comparison between each. Our results show that domain complexity dictates the appropriate approach to achieve high data efficiency. Finally, we distill the lessons from our experimental findings into a list of best practices for production-level NLG model development, and present them in a brief runbook. Importantly, the end products of all of the techniques are small sequence-to-sequence models (~2Mb) that we can reliably deploy in production. * Author list alphabetical by last name.

show abstract

On Project Based Teaching of Mechanical Design Course for Undergraduates in Shanghai Jiao Tong University

Guo

2019

Advances in Mechanism and Machine Science

View full text Add to dashboard Cite

Pre-trained language models (PLMs) have made remarkable progress in table-to-text generation tasks. However, the topological gap between tabular data and text and the lack of domain-specific knowledge make it difficult for PLMs to produce faithful text, especially in real-world applications with limited resources. In this paper, we mitigate the above challenges by introducing a novel augmentation method: Promptbased Adapter (PA), which targets table-to-text generation under few-shot conditions. The core insight design of the PA is to inject prompt templates for augmenting domain-specific knowledge and table-related representations into the model for bridging the structural gap between tabular data and descriptions through adapters. Such prompt-based knowledge augmentation method brings at least two benefits: (1) enables us to fully use the large amounts of unlabelled domain-specific knowledge, which can alleviate the PLMs' inherent shortcomings of lacking domain knowledge; (2) allows us to design different types of tasks supporting the generative challenge. Extensive experiments and analyses are conducted on three open-domain few-shot NLG datasets: human, song, and book. Compared to previous stateof-the-art approaches, our model achieves superior performance in terms of both fluency and accuracy as judged by human and automatic evaluations.

show abstract

Few-Shot NLG with Pre-Trained Language Model

Cited by 76 publications

References 20 publications

Logic2Text: High-Fidelity Natural Language Generation from Logical Forms

Logic2Text: High-Fidelity Natural Language Generation from Logical Forms

Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data

On Project Based Teaching of Mechanical Design Course for Undergraduates in Shanghai Jiao Tong University

Contact Info

Product

Resources

About