2017
DOI: 10.1016/j.infsof.2017.07.003
|View full text |Cite
|
Sign up to set email alerts
|

Automated change-prone class prediction on unlabeled dataset using unsupervised method

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
14
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(17 citation statements)
references
References 24 publications
0
14
0
Order By: Relevance
“…To achieve this goal, we have constructed a defect prediction model by exploiting the unlabelled software datasets of Geant4 that is one of the most rigorously validated software packages for the simulation of the passage of particles through matter [18]. Amongst the different ML methodologies, we have selected CLAMI [13] and CLAMI+ [14] in order to label the instances in the software datasets. In addition, we have applied a large set of ML techniques to predict defect-prone modules.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…To achieve this goal, we have constructed a defect prediction model by exploiting the unlabelled software datasets of Geant4 that is one of the most rigorously validated software packages for the simulation of the passage of particles through matter [18]. Amongst the different ML methodologies, we have selected CLAMI [13] and CLAMI+ [14] in order to label the instances in the software datasets. In addition, we have applied a large set of ML techniques to predict defect-prone modules.…”
Section: Methodsmentioning
confidence: 99%
“…More in detail, CLAMI+ transforms the Boolean representation in CLAMI of metrics' violation into a probabilistic value based on the difference between the metric value and the threshold. Consequently, CLAMI+ considers how much an instance violated on a metric and leads to a different selection of the final training set that is expected to be more informative than that built by CLAMI [14].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…A supervised technique uses an already labelled dataset to train a classification algorithm. In an unsupervised approach, a dataset is labelled using certain heuristics such as distance measures to cluster related texts (Yan et al, 2017).…”
Section: Related Workmentioning
confidence: 99%
“…The popularity of DL models in SE is mainly due to the advantages of representation learning from raw data [79,95,134]. For example, in many recent SE studies, a large number of challenges derive from the semantic comprehension of code in programming languages [84,85,148,149,154], text in natural languages [34,150], or their mutual transformation [44]. As code and text involves some form of natural language processing (NLP), it commonly starts with encoding words by a fixed size of vocabulary [44].…”
Section: Background and Related Work 21 DL Technology In Sementioning
confidence: 99%