Deep Reinforcement Learning for Sequence-to-Sequence Models

Keneshloo, Yaser; Shi, Tian; Ramakrishnan, Naren; Reddy, Chandan K.

doi:10.1109/tnnls.2019.2929141

Cited by 136 publications

(93 citation statements)

References 130 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Shortcomings of maximum-likelihood training for sequence generation have often been discussed (Ding and Soricut, 2017;Leblond et al, 2018;Ranzato et al, 2016), but without pointing to generalization as the key aspect. An overview of recent deep reinforcement learning methods for conditional generation can be found in (Keneshloo et al, 2018). Our proposed approach follows work by Ding et al (2017) and Tan et al (2018) by employing both, policy and reward for exploration.…”

Section: Related Workmentioning

confidence: 99%

Generalization in Generation: A closer look at Exposure Bias

Schmidt

2019

Proceedings of the 3rd Workshop on Neural Generation and Translation

View full text Add to dashboard Cite

Exposure bias refers to the train-test discrepancy that seemingly arises when an autoregressive generative model uses only ground-truth contexts at training time but generated ones at test time. We separate the contributions of the model and the learning framework to clarify the debate on consequences and review proposed counter-measures. In this light, we argue that generalization is the underlying property to address and propose unconditional generation as its fundamental benchmark. Finally, we combine latent variable modeling with a recent formulation of exploration in reinforcement learning to obtain a rigorous handling of true and generated contexts. Results on language modeling and variational sentence auto-encoding confirm the model's generalization capability.

show abstract

Section: Related Workmentioning

confidence: 99%

Generalization in Generation: A closer look at Exposure Bias

Schmidt

2019

Proceedings of the 3rd Workshop on Neural Generation and Translation

View full text Add to dashboard Cite

show abstract

“…For the Pointer-Generator Network from See et al (2017), we follow their implementation 4 and use a batch size 16. For Paulus et al (2018), we use an implementation from Keneshloo et al (2018) 5 . We did not include the intra-temporal attention and the intra-decoder attention because they hurt the performance.…”

Section: Implementation Detailsmentioning

confidence: 99%

This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation

Zhang¹,

Tetreault²

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

Given the overwhelming number of emails, an effective subject line becomes essential to better inform the recipient of the email's content. In this paper, we propose and study the task of email subject line generation: automatically generating an email subject line from the email body. We create the first dataset for this task and find that email subject line generation favor extremely abstractive summary which differentiates it from news headline generation or news single document summarization. We then develop a novel deep learning method and compare it to several baselines as well as recent state-of-the-art text summarization systems. We also investigate the efficacy of several automatic metrics based on correlations with human judgments and propose a new automatic evaluation metric. Our system outperforms competitive baselines given both automatic and human evaluations. To our knowledge, this is the first work to tackle the problem of effective email subject line generation.

show abstract

“…Other information retrieval based algorithms [10], [55] Single-encoder based algorithms [31], [33], [41], [89] Multiple-encoder based algorithms [32], [80] Other algorithms [64], [68], [81] Note that in the process of collecting papers, we first performed two types of searches for related papers: (1) Online library search for papers containing keywords including "code + comment", "comment", "code + summary" and "summary" in the fields of title, abstract and index terms of the papers from ACM Digital Library, IEEE Xplore Digital Library, DBLP, Google Scholar and arXiv.org. (2) Specific search of major conference proceedings and journals in software engineering and artificial intelligence, including IEEE ICSE, IEEE FSE, IEEE/ACM ASE, IEEE TSE, ACM TOSEM, EMSE, AAAI and IJCAI.…”

Section: Trends Of the Development Of Code Commenting Techniquesmentioning

confidence: 99%

“…That is, they use two encoders in the classical encoder-decoder framework. Under this framework, they exploit reinforcement learning model to solve two issues: exposure bias and inconsistency between train and test measurement [81]. They leverage an actor network and a critic network to jointly determine the next best word at each time step.…”

Section: B) Multiple-encoder Based Comment Generation Algorithmsmentioning

confidence: 99%

A Survey of Automatic Generation of Source Code Comments: Algorithms and Techniques

et al. 2019

View full text Add to dashboard Cite

As an integral part of source code files, code comments help improve program readability and comprehension. However, developers sometimes do not comment on their program code adequately due to the incurred extra efforts, lack of relevant knowledge, unawareness of the importance of code commenting or some other factors. As a result, code comments can be inadequate, absent or even mismatched with source code, which affects the understanding, reusing and the maintenance of software. To solve these problems of code comments, researchers have been concerned with generating code comments automatically. In this work, we aim at conducting a survey of automatic code commenting researches. First, we generally analyze the challenges and research framework of automatic generation of program comments. Second, we present the classification of representative algorithms, the design principles, strengths and weaknesses of each category of algorithms. Meanwhile, we also provide an overview of the quality assessment of the generated comments. Finally, we summarize some future directions for advancing the techniques of automatic generation of code comments and the quality assessment of comments.Automatic code comment generation concerns the production of some textual descriptions of source code. The essential task is to translate the code written in programming languages into textual comments written in natural languages. Meanwhile, comments may describe not only the functions, but also the design intents of developers behind source code. In brief, automatic code commenting is to generate textual description written in natural languages automatically for source code by means of source code analysis, which can reveal the design intents, program logic, functionality of programs and the meanings of the related parameters, etc.

show abstract

Deep Reinforcement Learning for Sequence-to-Sequence Models

Cited by 136 publications

References 130 publications

Generalization in Generation: A closer look at Exposure Bias

Generalization in Generation: A closer look at Exposure Bias

This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation

A Survey of Automatic Generation of Source Code Comments: Algorithms and Techniques

Contact Info

Product

Resources

About