Controlling Output Length in Neural Encoder-Decoders

Kikuchi, Yuta; Neubig, Graham; Sasano, Ryohei; Takamura, Hiroya; Okumura, Manabu

doi:10.18653/v1/d16-1140

Cited by 187 publications

(175 citation statements)

References 28 publications

Supporting

Mentioning

164

Contrasting

Unclassified

Order By: Relevance

“…With lstm+leninit, 2 out of 100 sentences, chosen randomly, ended with a word that cannot be located at the end of a sentence. In contrast, with lstm+lenemb, 24 sentences ended with such words and therefore are ungrammatical, although lenemb has shown to be effective in abstractive sentence summarization (Kikuchi et al, 2016). This result suggests that lstm+lenemb is excessively affected by the desired length because lenemb receives the potential desired length at each time of decoding.…”

Section: Human Evaluationmentioning

confidence: 95%

See 1 more Smart Citation

Japanese Sentence Compression with a Large Training Dataset

Hasegawa¹,

Kikuchi

Takamura

et al. 2017

Proceedings of the 55th Annual Meeting of the Association For Computational Linguistics (Volume 2: Short Papers)

Self Cite

View full text Add to dashboard Cite

In English, high-quality sentence compression models by deleting words have been trained on automatically created large training datasets. We work on Japanese sentence compression by a similar approach. To create a large Japanese training dataset, a method of creating English training dataset is modified based on the characteristics of the Japanese language. The created dataset is used to train Japanese sentence compression models based on the recurrent neural network.

show abstract

Section: Human Evaluationmentioning

confidence: 95%

“…As the second and the third models, we extend the first model to control the output length (Kikuchi et al, 2016). The second model, lstm+leninit, initializes the memory cell of the decoder as follows: m 0 = tarlen * b len where tarlen is the desired output length, and b len is a trainable parameter.…”

Section: Sentence Compression With Lstmmentioning

confidence: 99%

Japanese Sentence Compression with a Large Training Dataset

Hasegawa¹,

Kikuchi

Takamura

et al. 2017

Proceedings of the 55th Annual Meeting of the Association For Computational Linguistics (Volume 2: Short Papers)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Each control variable corresponds to an attribute of a sentence. Compared to previous work that only seeks to control the values of sentiment (Hu et al, 2017) and length (Kikuchi et al, 2016), we further explore neural text generation with particular verbal predicates, semantic frames, and automatically-induced clusters.…”

Section: Introductionmentioning

confidence: 99%

Generating Diverse Story Continuations with Controllable Semantics

Ding

Yu³

et al. 2019

Proceedings of the 3rd Workshop on Neural Generation and Translation

View full text Add to dashboard Cite

We propose a simple and effective modeling framework for controlled generation of multiple, diverse outputs. We focus on the setting of generating the next sentence of a story given its context. As controllable dimensions, we consider several sentence attributes, including sentiment, length, predicates, frames, and automatically-induced clusters. Our empirical results demonstrate: (1) our framework is accurate in terms of generating outputs that match the target control values;(2) our model yields increased maximum metric scores compared to standard n-best list generation via beam search; (3) controlling generation with semantic frames leads to a stronger combination of diversity and quality than other control variables as measured by automatic metrics. We also conduct a human evaluation to assess the utility of providing multiple suggestions for creative writing, demonstrating promising results for the potential of controllable, diverse generation in a collaborative writing system.

show abstract

“…Thus, we expect to decisively prohibit excessive generation. Finally, we evaluate the effectiveness of our method on well-studied ABS benchmark data provided by Rush et al (2015), and evaluated in (Chopra et al, 2016;Nallapati et al, 2016b;Kikuchi et al, 2016;Takase et al, 2016;Ayana et al, 2016;Gulcehre et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization

Suzuki¹,

Nagata²

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2

View full text Add to dashboard Cite

This paper tackles the reduction of redundant repeating generation that is often observed in RNN-based encoder-decoder models. Our basic idea is to jointly estimate the upper-bound frequency of each target vocabulary in the encoder and control the output words based on the estimation in the decoder. Our method shows significant improvement over a strong RNN-based encoder-decoder baseline and achieved its best results on an abstractive summarization benchmark.

show abstract

Controlling Output Length in Neural Encoder-Decoders

Cited by 187 publications

References 28 publications

Japanese Sentence Compression with a Large Training Dataset

Japanese Sentence Compression with a Large Training Dataset

Generating Diverse Story Continuations with Controllable Semantics

Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization

Contact Info

Product

Resources

About