Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis 2020
DOI: 10.1145/3395363.3397362
|View full text |Cite
|
Sign up to set email alerts
|

Functional code clone detection with syntax and semantics fusion learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
47
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 90 publications
(47 citation statements)
references
References 33 publications
0
47
0
Order By: Relevance
“…2) Code Clone Detection: There are supervised and unsupervised approaches to detect clones. While deep learning methods are applied to detect code clones, they require labelled data to train a supervised learning model [14,44,55]. As such, one needs human annotators to mark the pairs of snippets as clones, limiting the ability to detect clones by large amount of the data one can collect.…”
Section: U S E Ca S E Smentioning
confidence: 99%
See 2 more Smart Citations
“…2) Code Clone Detection: There are supervised and unsupervised approaches to detect clones. While deep learning methods are applied to detect code clones, they require labelled data to train a supervised learning model [14,44,55]. As such, one needs human annotators to mark the pairs of snippets as clones, limiting the ability to detect clones by large amount of the data one can collect.…”
Section: U S E Ca S E Smentioning
confidence: 99%
“…Note that we do not compare with techniques such as Oreo [55], CCD [14], ASTNN [44] because they use supervised learning techniques to build clone classifiers. We believe that the code embeddings or the weights from the pretrained InferCode can be used for training supervised clone classifiers too, and with further improvement on self-supervised learning techniques such as improving the encoder, the auto-identified labels, and the loss function, the performance of unsupervised code clone detection may also get close to supervised ones.…”
Section: B Code Clone Detection 1) Datasets Metrics and Baselinesmentioning
confidence: 99%
See 1 more Smart Citation
“…An abstract syntax tree (AST) is a tree representation of the abstract syntactic structure of source code written in a programming language [11]. The abstract syntax tree clearly describes the structure of the source code.…”
Section: B Abstract Syntax Treementioning
confidence: 99%
“…In many existing studies, source code is parsed into abstract syntax trees to produce code representations that capture the semantic relationships between different code elements [12], [13]. Code representation based on abstract syntax trees is now being used for code clone detection [11], defect prediction [14], auto program repair [15], and other problems. In metric-based code smell detection methods, abstract syntax trees may also be used to compute a set of source code metrics [7], [16].…”
Section: B Abstract Syntax Treementioning
confidence: 99%