2021
DOI: 10.1109/access.2021.3079156
|View full text |Cite
|
Sign up to set email alerts
|

An Effective Semantic Code Clone Detection Framework Using Pairwise Feature Fusion

Abstract: Code clone detection is important for effective software maintenance. The task is more challenging when clones are semantically similar (Type-IV) in nature, having no structural resemblance to each other. Most existing methods use sequence similarity and/or graph isomorphism between either Abstract Syntax Trees (AST) or Program Dependency Graphs (PDG) to detect Type-I, II and III clones. However, they are mostly unsuccessful in detecting semantic or Type-IV clones. In this work, we propose a novel detection fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 64 publications
0
7
0
Order By: Relevance
“…It achieves higher recall and F1 scores, with values of (96.3%, 89.7%) and (96.4%, 92.3%) respectively. Furthermore, the proposed technique, utilizing the LightBGM classifier outperformed CDLH, ASTNN, and Sheneamer et al [78] in detecting VST3, ST3, and MT3 clones in terms of F1-score.…”
Section: Plos Onementioning
confidence: 89%
See 2 more Smart Citations
“…It achieves higher recall and F1 scores, with values of (96.3%, 89.7%) and (96.4%, 92.3%) respectively. Furthermore, the proposed technique, utilizing the LightBGM classifier outperformed CDLH, ASTNN, and Sheneamer et al [78] in detecting VST3, ST3, and MT3 clones in terms of F1-score.…”
Section: Plos Onementioning
confidence: 89%
“…ASTNN [77] employs an AST-based neural network that represents source code with smaller statement trees derived from segmented ASTs, capturing both lexical and syntactical knowledge. Finally, Sheneamer et al [78] employ multiple machine learning classifiers to detect code clones, using features extracted from ASTs and PDGs. DLC [76] uses a recursive neural network approach to detect code similarity, utilizing the Euclidean distance as a metric.…”
Section: Plos Onementioning
confidence: 99%
See 1 more Smart Citation
“…An efficient semantic CCD (ES-CCD) technique was developed [13] by employing a pair-wise feature fusion for automated identification of all four clone types. AST and program dependency graphs (PDGs) were efficiently utilized to prepare labelled training features for detecting the Java code clones, including semantic clones.…”
Section: Literature Surveymentioning
confidence: 99%
“…Appache opennlp-master 1.9.1 is a natural language information retrieval toolkit based on machine learning. Among the most common NLP operations supported are indexing, sentence classification, part-of-speech labelling, named entity identification, stacking, and processing [25] Using these datasets, the proposed method's efficiency is compared to that of existing methods such as LV-CCD [9], ES-CCD [13], TBCNN-CCD [14], and CPVDetector [15] terms of accuracy, precision, recall, time period, memory space, and clone types for detecting all four types of CC detection and its similarity features.…”
Section: Performance Evaluationmentioning
confidence: 99%