2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE) 2019
DOI: 10.1109/icse.2019.00087
|View full text |Cite
|
Sign up to set email alerts
|

A Neural Model for Generating Natural Language Summaries of Program Subroutines

Abstract: Source code summarization -creating natural language descriptions of source code behavior -is a rapidly-growing research topic with applications to automatic documentation generation, program comprehension, and software maintenance. Traditional techniques relied on heuristics and templates built manually by human experts. Recently, data-driven approaches based on neural machine translation have largely overtaken template-based systems. But nearly all of these techniques rely almost entirely on programs having … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

5
294
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 251 publications
(299 citation statements)
references
References 51 publications
5
294
0
Order By: Relevance
“…Problem Statement The problem we target in this paper is the extraction of summary descriptions from unstructured subroutine comments. By "summary descriptions," we mean a short natural language explanation of code behavior or purpose (maximum 12 words, in line with related work [10], which found that most summary descriptions consisted of fewer than 13 word tokens). By "unstructured subroutine comments," we mean the long comments immediately preceding methods in source code.…”
Section: Introductionmentioning
confidence: 82%
See 4 more Smart Citations
“…Problem Statement The problem we target in this paper is the extraction of summary descriptions from unstructured subroutine comments. By "summary descriptions," we mean a short natural language explanation of code behavior or purpose (maximum 12 words, in line with related work [10], which found that most summary descriptions consisted of fewer than 13 word tokens). By "unstructured subroutine comments," we mean the long comments immediately preceding methods in source code.…”
Section: Introductionmentioning
confidence: 82%
“…Referring to the process as "summarization" alludes to a history of work in Natural Language Processing of extractive summarization of documents -early attempts at code summarization involved choosing a set of n important words from code [18], [19] and then converting those words into complete sentences by placing them into sentence templates [2], [20]- [22]. A 2016 survey [23] highlights these approaches around the time that a vast majority of code summarization techniques began to be based on neural networks trained from big data input [10], [14], [24]- [27]. These NN-based approaches have proliferated, but suffer an Achilles' heel of reliance on very large, clean datasets of examples of code comments.…”
Section: A Source Code Summarizationmentioning
confidence: 99%
See 3 more Smart Citations