Proceedings of the 4th International Workshop on Software Clones 2010
DOI: 10.1145/1808901.1808919
|View full text |Cite
|
Sign up to set email alerts
|

Towards a multi-scale approach for source code approximate match report

Abstract: International audienceFinding exact clones in source code can be e ciently handled using classical exact substring or subtree pattern match- ing techniques inspired from genomics applications. These methods may be wisely employed as a foundation to sketch new techniques highlighting duplicated code chunks present- ing minor edits or more extensive modi cations at a higher structural scale. The main goal is to improve recall of small near matches and to aggregate them into larger ones to pro- vide a more global… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2010
2010
2013
2013

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 12 publications
0
3
0
Order By: Relevance
“…Abstract syntax tree representations could allow more sophisticate patterns of pre-processing of the representation for better abstraction and normalization of the code, a topic that has been neglected in this article. We are investigating some new techniques in this way [21,32,23] that could also consider the function call graphs of the projects. for the computed similarity metrics between the original project and the obfuscated versions.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Abstract syntax tree representations could allow more sophisticate patterns of pre-processing of the representation for better abstraction and normalization of the code, a topic that has been neglected in this article. We are investigating some new techniques in this way [21,32,23] that could also consider the function call graphs of the projects. for the computed similarity metrics between the original project and the obfuscated versions.…”
Section: Resultsmentioning
confidence: 99%
“…If abstract syntax trees [30,23] are considered, some type 3 to type 4 clones can even be managed (tiny expression rewriting or transposition of independent code). Except for suffix indexation techniques other approaches rely on the meta-tokenization of a string of tokens or a brotherhood of sub-trees.…”
Section: Studying the Factorized Graph Nodes And Its Inferred Metricsmentioning
confidence: 99%
See 1 more Smart Citation