2022
DOI: 10.1111/exsy.12977
|View full text |Cite
|
Sign up to set email alerts
|

Eliminating the high false‐positive rate in defect prediction through BayesNet with adjustable weight

Abstract: In defect prediction, a high false‐positive rate (FPR) caused by class imbalance not only increases the workload of testing and development but also consumes unnecessary costs. Many defect models against class imbalance have been proposed to improve the accuracy of defect prediction, but their ability to reduce FPR is unclear. To solve these problems, we first proposed a BayesNet with adjustable weights, called WBN, to reduce the FPR in software defect prediction, which is an algorithm independent of data prep… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 72 publications
(110 reference statements)
0
5
0
Order By: Relevance
“…Similar to classification techniques, these indicators also take into account the requirements of quantity and type, and these indicators have been widely used in the previous defect prediction research [86,87]. Nevertheless, more other types of indicators are not used in this paper [88][89][90], such as Gmeasure and FPR, etc. Recently effort-aware indicators have also been proposed to measure the performance of SDP models [91][92][93].…”
Section: Classification Techniquesmentioning
confidence: 99%
“…Similar to classification techniques, these indicators also take into account the requirements of quantity and type, and these indicators have been widely used in the previous defect prediction research [86,87]. Nevertheless, more other types of indicators are not used in this paper [88][89][90], such as Gmeasure and FPR, etc. Recently effort-aware indicators have also been proposed to measure the performance of SDP models [91][92][93].…”
Section: Classification Techniquesmentioning
confidence: 99%
“…The higher rate of false positives that the CART can generate has a detrimental effect on the amount of time and resources required to investigate false positives, which are non-buggy classes mistakenly labeled as buggy (Yanyang et al, 2022). Its poor F1score of 0.9114, which is exceeded by the RF (0.9169), provides additional evidence that this is the case.…”
Section: Cart (Classification and Regression Trees) Resultsmentioning
confidence: 99%
“…The samples of each category are randomly divided into 70% and 30%, samples are used to search the optimal hyperparameters of the SVM classifier, and the remaining samples are used to test the fault diagnosis accuracy of the PMDCMs. For each fault diagnosis, the accuracy of PMDCM’s model is checked by utilizing the combination of and ; the 10-fold cross-validation [ 43 ] is utilized while the aim is to obtain stable fault diagnosis accuracy. The process of 10-fold cross-validation can be seen in Figure 13 .…”
Section: 3 the Results Based On Svm Classifiermentioning
confidence: 99%