2019
DOI: 10.14710/jtsiskom.7.4.2019.121-126
|View full text |Cite
|
Sign up to set email alerts
|

Parameter tuning in KNN for software defect prediction: an empirical analysis

Abstract: Software Defect Prediction (SDP) provides insights that can help software teams to allocate their limited resources in developing software systems. It predicts likely defective modules and helps avoid pitfalls that are associated with such modules. However, these insights may be inaccurate and unreliable if parameters of SDP models are not taken into consideration. In this study, the effect of parameter tuning on the k nearest neighbor (k-NN) in SDP was investigated. More specifically, the impact of varying an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0
5

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 26 publications
(14 citation statements)
references
References 28 publications
0
9
0
5
Order By: Relevance
“…It involves the deployment of machine learning (ML) methods on software features or metrics derived from software systems repositories to predict the quality and reliability of a software system [1,2]. These software features are the quantifiable attributes of software system that can be analyzed to ascertain software systems quality and reliability [3,4]. Knowledge gained from SDP processes can be used by software engineers for improving software development processes and managing limited software resources.…”
Section: Introductionmentioning
confidence: 99%
“…It involves the deployment of machine learning (ML) methods on software features or metrics derived from software systems repositories to predict the quality and reliability of a software system [1,2]. These software features are the quantifiable attributes of software system that can be analyzed to ascertain software systems quality and reliability [3,4]. Knowledge gained from SDP processes can be used by software engineers for improving software development processes and managing limited software resources.…”
Section: Introductionmentioning
confidence: 99%
“…Kajian penggunaan algoritme kNN tersebut menunjukkan bahwa kemampuan kNN sangat tergantung pada nilai k yang digunakan. Beragam kajian mengkaji bagaimana menentukan nilai k terbaik, seperti dalam [4], [11]- [13]. Namun, penentuan nilai k masih trivial dan sangat tergantung pada dataset yang digunakan.…”
Section: Pendahuluanunclassified
“…Namun, bias yang mendekati 0 akan menimbullkan dugaan terjadi overfitting seperti dinyatakan [24]. Nilai k yang terbaik juga bervariasi, tidak terlalu dekat dengan akar dari jumlah sampel seperti disebutkan dalam [1], namun lebih bersifat trivial atau uji coba seperti dalam [11].…”
Section: A Pengaruh Variasi Nilai K Dan Lag Untuk Prediksi 1 Bulan Bunclassified
See 1 more Smart Citation
“…SDP models are built on details from software features such as source code complexity, software development history, software cohesion and coupling to predict defective modules in software systems. These software features are numerically quantified to determine the level of software systems quality and reliability [ 15 – 18 ].…”
Section: Introductionmentioning
confidence: 99%