Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering 2017
DOI: 10.1145/3106237.3121274
|View full text |Cite
|
Sign up to set email alerts
|

Suggesting meaningful variable names for decompiled code: a machine translation approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
23
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 13 publications
(23 citation statements)
references
References 14 publications
0
23
0
Order By: Relevance
“…In t r o d u c t io n Deep Learning (DL) has been used to support a vast variety of code-related tasks. Some examples include automatic bug fixing [1]- [4], learning generic code changes [5], code migration [6 ], [7], code summarization [8]- [11], pseudo-code generation [12], code deobfuscation [13], [14], injection of code mutants [15], automatic generation of assert statements [16], and code completion [17]- [21]. These works customize DL models proposed in the Natural Language Processing (NLP) field to support the previously listed tasks.…”
mentioning
confidence: 99%
“…In t r o d u c t io n Deep Learning (DL) has been used to support a vast variety of code-related tasks. Some examples include automatic bug fixing [1]- [4], learning generic code changes [5], code migration [6 ], [7], code summarization [8]- [11], pseudo-code generation [12], code deobfuscation [13], [14], injection of code mutants [15], automatic generation of assert statements [16], and code completion [17]- [21]. These works customize DL models proposed in the Natural Language Processing (NLP) field to support the previously listed tasks.…”
mentioning
confidence: 99%
“…This is achieved by using two encoders-a lexical encoder (Section III-B1) and a structural encoder (Section III-B2)-to separately capture the lexical and structural signals in the decompiled code. As we will show, this combination of lexical and structural information allows DIRE to outperform techniques that rely on lexical information alone [21].…”
Section: A Overviewmentioning
confidence: 86%
“…The remaining step, mapping developer-chosen names to variable IDs, is the core challenge in automatic corpus generation. Following our previous approach [21], we leverage the decompiler's ability to incorporate developer-chosen identifier names into decompiled code when DWARF debugging symbols [26] are present in the binary. However, this alone is not sufficient to identify which developer-chosen name maps to a particular variable ID generated in the first step.…”
Section: Generation Of Training Datamentioning
confidence: 99%
See 2 more Smart Citations