Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of 2019
DOI: 10.1145/3338906.3341182
|View full text |Cite
|
Sign up to set email alerts
|

CloneCognition: machine learning based code clone validation tool

Abstract: A code clone is a pair of code fragments, within or between software systems that are similar. Since code clones often negatively impact the maintainability of a software system, several code clone detection techniques and tools have been proposed and studied over the last decade. However, the clone detection tools are not always perfect and their clone detection reports often contain a number of false positives or irrelevant clones from specific project management or user perspective. To detect all possible s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…Since many clone detectors return code fragments that are not considered clones by users, it is necessary to perform manual validation of the reported possible clones. Mostaeen et al [176] present a tool called CloneCognition that can automate the laborious manual validation process by using Artiicial Neural Network (ANN). Their tool showed promising performance when compared with state-of-the-art clone validation techniques.…”
Section: Performance Predictionmentioning
confidence: 99%
“…Since many clone detectors return code fragments that are not considered clones by users, it is necessary to perform manual validation of the reported possible clones. Mostaeen et al [176] present a tool called CloneCognition that can automate the laborious manual validation process by using Artiicial Neural Network (ANN). Their tool showed promising performance when compared with state-of-the-art clone validation techniques.…”
Section: Performance Predictionmentioning
confidence: 99%
“…Bandara and Wijayarathna [38] identified features such as the number of characters and words, identifier count, identifier character count, and underscore count using antlr tool. Some studies [221,223,224] utilized line similarity and token similarity. Yang et al [349] computed tf-idf along with other metrics such as position of clones in the file.…”
Section: Code Clone Detectionmentioning
confidence: 99%
“…Similarly, Sheneamer and Kalita [288] compared the performance of Support Vector Machine, Linear Discriminant Analysis, Instance-Based Learner, Lazy K-means, Decision Tree, Naive Bayes, Multilayer Perceptron, and Logit Boost. dl-based models: dl models such as ann [223,224], dnn [99,365], and rnn with Reverse neural network [342] are also employed extensively. Bui et al [58] and Bui et al [57] combined neural networks for ml models training.…”
Section: Model Trainingmentioning
confidence: 99%
“…On the other hand, creating clones in source code may introduce new defects or bugs, which can lead to extra maintenance effort to ensure consistency among cloned code snippets. Since software clones have a clear influence on software maintenance and evolution, many studies [85,90,118,139,145,146] proposed various approaches to detect and management code clones in source code.…”
Section: Configuration Managementmentioning
confidence: 99%
“…They also applied Random Forest classifier to predict the life expectancy of newly-introduced clones. Mostaeen et al [85] present a tool called CloneCognition that can automate the laborious manual validation process by using Artificial Neural Network (ANN). Their tool showed promising performance when compared with state-of-the-art clone validation techniques.…”
Section: Configuration Managementmentioning
confidence: 99%