Semantic Noise Matters for Neural Natural Language Generation

Dušek, Ondřej; Howcroft, David M.; Rieser, Verena

doi:10.18653/v1/w19-8652

Cited by 80 publications

(105 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The former path is risky as it easily results in ungrammatical targets. The latter approach of enforcing a stronger alignment between inputs and outputs has been tried previously but it assumes a moderate amount of noise in the data (Nie et al, 2019;Dušek et al, 2019). Alternatively, one can leave the data as is and try to put more pressure on the decoder to pay attention to the input at every generation step (Tian et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

Filippova¹

2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Neural text generation (data-or text-to-text) demonstrates remarkable performance when training data is abundant which for many applications is not the case. To collect a large corpus of parallel data, heuristic rules are often used but they inevitably let noise into the data, such as phrases in the output which cannot be explained by the input. Consequently, models pick up on the noise and may hallucinategenerate fluent but unsupported text. Our contribution is a simple but powerful technique to treat such hallucinations as a controllable aspect of the generated text, without dismissing any input and without modifying the model architecture. On the WikiBio corpus (Lebret et al., 2016), a particularly noisy dataset, we demonstrate the efficacy of the technique both in an automatic and in a human evaluation.

show abstract

Section: Introductionmentioning

confidence: 99%

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

Filippova¹

2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

show abstract

“…But this is not a straightforward task. Specially designed regular expressions (Dušek et al, 2019a) or heuristics involving dependency relations (Oraby et al, 2019) must be used. Augmented Input Sequence Once the surface forms of each attribute-value pair in a target utterance are found, we add them to the input sequence, as shown in Figure 2.…”

Section: Methodsmentioning

confidence: 99%

“…To extract the surface form of each attribute-value pair from a target utterance, we used modified regular expressions from Dušek et al (2019a). The input sequence was constructed in the format of a single token representing an attribute-value pair followed by multiple tokens for the surface form, e.g.…”

Section: Applying the Surface Forms Methodsmentioning

confidence: 99%

“…To compare the diversity of the E2E training data with that of the generated text, we looked at the surface forms used to express each attribute-value pair. This is enabled by a set of regular expressions released by Dušek et al (2019a) 1 . The regular expressions capture the entire phrase used to express an attribute-value pair, focusing on the content words and attempting as much as possible to leave out the function words, e.g.…”

Section: Frequency Of Surface Formsmentioning

confidence: 99%

See 1 more Smart Citation

How to Make Neural Natural Language Generation as Reliable as Templates in Task-Oriented Dialogue

Elder¹,

O’Connor

Foster

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Neural Natural Language Generation (NLG) systems are well known for their unreliability. To overcome this issue, we propose a data augmentation approach which allows us to restrict the output of a network and guarantee reliability. While this restriction means generation will be less diverse than if randomly sampled, we include experiments that demonstrate the tendency of existing neural generation approaches to produce dull and repetitive text, and we argue that reliability is more important than diversity for this task. The system trained using this approach scored 100% in semantic accuracy on the E2E NLG Challenge dataset, the same as a template system.

show abstract

“…In particular, this can be very likely if "the girl wants" appears much more frequent than "the boy wants" in the training corpus. This is a very important issue, because of its wide existence across many neural graph-to-text arXiv:2102.06749v1 [cs.CL] 12 Feb 2021 generation models, hindering the usability of these models for real-world applications (Dušek et al, 2018(Dušek et al, , 2019Balakrishnan et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

Structural Information Preserving for Graph-to-Text Generation

Song¹,

Wang²,

Su³

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

The task of graph-to-text generation aims at producing sentences that preserve the meaning of input graphs. As a crucial defect, the current state-of-the-art models may mess up or even drop the core structural information of input graphs when generating outputs. We propose to tackle this problem by leveraging richer training signals that can guide our model for preserving input information. In particular, we introduce two types of autoencoding losses, each individually focusing on different aspects (a.k.a. views) of input graphs. The losses are then back-propagated to better calibrate our model via multi-task training. Experiments on two benchmarks for graph-to-text generation show the effectiveness of our approach over a state-of-the-art baseline. Our code is available at http://github.com/ Soistesimmer/AMR-multiview.

show abstract

Semantic Noise Matters for Neural Natural Language Generation

Cited by 80 publications

References 28 publications

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data

How to Make Neural Natural Language Generation as Reliable as Templates in Task-Oriented Dialogue

Structural Information Preserving for Graph-to-Text Generation

Contact Info

Product

Resources

About