Code Clone Analysis 2021
DOI: 10.1007/978-981-16-1927-4_4
|View full text |Cite
|
Sign up to set email alerts
|

SourcererCC: Scalable and Accurate Clone Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
59
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(61 citation statements)
references
References 21 publications
1
59
0
Order By: Relevance
“…Moreover, since the amount of code that has to be processed is in a scale of millions line of code (as shown in Table 1), a clone detector must be scalable enough to report clones in a reasonable amount of time. We have tried 7 state-of-the-art clone detectors including Simian [66], SourcererCC [64], NiCad [13], [62], CCFinder [29], iClones [25], DECKARD [28], and PMD-CPD [51] against the Stack Overflow and Qualitas datasets. NiCad failed to parse 44,960 Stack Overflow snippets while PMD CPD failed to complete the execution due to lexical errors.…”
Section: Phase 1: Clone Identificationmentioning
confidence: 99%
“…Moreover, since the amount of code that has to be processed is in a scale of millions line of code (as shown in Table 1), a clone detector must be scalable enough to report clones in a reasonable amount of time. We have tried 7 state-of-the-art clone detectors including Simian [66], SourcererCC [64], NiCad [13], [62], CCFinder [29], iClones [25], DECKARD [28], and PMD-CPD [51] against the Stack Overflow and Qualitas datasets. NiCad failed to parse 44,960 Stack Overflow snippets while PMD CPD failed to complete the execution due to lexical errors.…”
Section: Phase 1: Clone Identificationmentioning
confidence: 99%
“…Manual clone detection is a tedious and error-prone task, and known to be infeasible for larger systems [62]. Tool-based clone detection requires manual verification and postprocessing, since even the most advanced clone detection tools have imperfect precision and recall [63].…”
Section: Rq4mentioning
confidence: 99%
“…BigCloneBench has been primarily developed to measure and compare the recall of clone detection tools [70,[78][79][80]. However, it can also be used for other clone and so ware studies [80].…”
Section: Bigclonebench For Code Clonesmentioning
confidence: 99%