SoutheastCon 2018 2018
DOI: 10.1109/secon.2018.8478911
|View full text |Cite
|
Sign up to set email alerts
|

Applying Machine Learning to Predict Software Fault Proneness Using Change Metrics, Static Code Metrics, and a Combination of Them

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
21
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(23 citation statements)
references
References 5 publications
1
21
0
Order By: Relevance
“…Once obtained the labelled dataset, we have applied various ML techniques, previously selected among those that have already been used in the Software Engineering field, and, more in detail, in the software defect prediction problem, as presented in previous literature [25]: AdaBoost (AB) [26], Boosted Logistic Regression (BLR) [21,27], J48 [28], Cost-Sensitive C5.0 (C5.0 Cost) [29], Logistic Model Tree (LMT) [30], Multilayer Perceptron (MLP) [31], Support Vector Machines with Radial Basis Function Kernel (SVM Radial) [32], Partial Least Squares (PLS) [33], Boosted Tree (BT) [34] and Random Forest (RF) [35]. In order to compare the different ML techniques, we have employed the most common performance indicators detailed in literature.…”
Section: Methodsmentioning
confidence: 99%
“…Once obtained the labelled dataset, we have applied various ML techniques, previously selected among those that have already been used in the Software Engineering field, and, more in detail, in the software defect prediction problem, as presented in previous literature [25]: AdaBoost (AB) [26], Boosted Logistic Regression (BLR) [21,27], J48 [28], Cost-Sensitive C5.0 (C5.0 Cost) [29], Logistic Model Tree (LMT) [30], Multilayer Perceptron (MLP) [31], Support Vector Machines with Radial Basis Function Kernel (SVM Radial) [32], Partial Least Squares (PLS) [33], Boosted Tree (BT) [34] and Random Forest (RF) [35]. In order to compare the different ML techniques, we have employed the most common performance indicators detailed in literature.…”
Section: Methodsmentioning
confidence: 99%
“…Commonly used metrics can be generally divided into three categories: static code metrics, network metrics, and process metrics. Static code metrics measure the complexity of source code and assume that the more complex the source code is, the more likely defects are to appear [37]. Network metrics [38], which are effective for SDP, are social network analysis (SNA) metrics calculated based on the dependency graph of a software system and quantify the topological structure of each node of the dependency graph in a certain sense.…”
Section: Related Work a Software Defect Predictionmentioning
confidence: 99%
“…We provide all the classifiers the same data and we fixed the ratio to 0.3 for the training and testing sets. We use the confusion matrix shown in Table 4 to compute the performance of the classifiers [39]. This matrix provides the values of the following metrics such as accuracy, precision, recall, the F1-score expressed as follows:…”
Section: A Fraud Detection and Risk Measurementmentioning
confidence: 99%