2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE) 2019
DOI: 10.1109/ase.2019.00064
|View full text |Cite
|
Sign up to set email alerts
|

DIRE: A Neural Approach to Decompiled Identifier Naming

Abstract: The decompiler is one of the most common tools for examining binaries without corresponding source code. It transforms binaries into high-level code, reversing the compilation process. Decompilers can reconstruct much of the information that is lost during the compilation process (e.g., structure and type information). Unfortunately, they do not reconstruct semantically meaningful variable names, which are known to increase code understandability. We propose the Decompiled Identifier Renaming Engine (DIRE), a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
58
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 58 publications
(62 citation statements)
references
References 33 publications
0
58
0
Order By: Relevance
“…Code completion [32,34,36,52] is one of the most widely explored topics. Machine learning models of code are also used to predict names of variables and functions [2,3,8,9,12], with applications to deobfuscation [51,59] and decompilation [22,30,41]. Significant effort has been made towards automatically generating documentation from code or vice versa [8,11,24,35].…”
Section: Related Workmentioning
confidence: 99%
“…Code completion [32,34,36,52] is one of the most widely explored topics. Machine learning models of code are also used to predict names of variables and functions [2,3,8,9,12], with applications to deobfuscation [51,59] and decompilation [22,30,41]. Significant effort has been made towards automatically generating documentation from code or vice versa [8,11,24,35].…”
Section: Related Workmentioning
confidence: 99%
“…A binary function can have structural and textual properties. Being inspired by word2vec, different techniques have been proposed by Ding et al [17] and others [18], [19], [20], [21], [22], [23], [24] to model the structural and textual aspects of a binary function as an embedding vector or to compute the similarity using deep neural networks.…”
Section: Deep Learningmentioning
confidence: 99%
“…So these conventional modeling methods need to be adjusted. DIRE [21] used both lexical information obtained from the tokenized code as well as structural information obtained from the corresponding ASTs to recover variable names. David et al [22] combined static analysis with encoder-decoder-based models to predict procedure names in stripped binaries.…”
Section: Related Workmentioning
confidence: 99%