2005
DOI: 10.1093/comjnl/bxh119
|View full text |Cite
|
Sign up to set email alerts
|

PDetect: A Clustering Approach for Detecting Plagiarism in Source Code Datasets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0
6

Year Published

2013
2013
2019
2019

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 55 publications
(41 citation statements)
references
References 11 publications
0
33
0
6
Order By: Relevance
“…A clustering based approach, P-detect is used for detecting plagiarism in source datasets, developed by Lefteris Moussiades and Athena Vakali [18].This P-detect firstly take the set of programs as input and represents the programs as a set of keywords. A similarity measure evaluation by Jaccard"s similarity coefficient is performed for each pair of programs.…”
Section: International Journal Of Computer Applications (0975 -8887) mentioning
confidence: 99%
“…A clustering based approach, P-detect is used for detecting plagiarism in source datasets, developed by Lefteris Moussiades and Athena Vakali [18].This P-detect firstly take the set of programs as input and represents the programs as a set of keywords. A similarity measure evaluation by Jaccard"s similarity coefficient is performed for each pair of programs.…”
Section: International Journal Of Computer Applications (0975 -8887) mentioning
confidence: 99%
“…However, the winnowing algorithm requires calculating set inclusion, which is expensive when comparing many features. Additional plagiarism detection techniques are explored in [43][44][45], but they use complimentary techniques for plagiarism detection. Namely, source code clustering and manual analysis, program dependency graphs, and measuring approximate Kolmogorov complexity between programs, respectively.…”
Section: Related Workmentioning
confidence: 99%
“…A ideia desse método é baseada na proposta de representação e classificação de perfis de e [Oliveira et al 2014] e na estratégia de clusterização de plágios em exercícios de programação de [Moussiades and Vakali 2005].…”
Section: Fundamentação Teóricaunclassified
“…O PDetect é um sistema que faz análise e clusterização de plágios em exercícios de programação [Moussiades and Vakali 2005]. Para isso, o PDetect recebe códigos-fontes, mapeia-os em um conjunto de tokens de uma linguagem de programação, identifica os pares suspeitos de plágios e os reúne em clusters, formados a partir de um parâmetro de corte, que é o Coeficiente de Jaccard [Moussiades and Vakali 2005].…”
Section: Trabalhos Relacionadosunclassified
See 1 more Smart Citation