Machine Learning Techniques for Software Bug Prediction: A Systematic Review

Saharudin, Syahana Nur’Ain; Koh, Tieng Wei; Kew, Si Na

doi:10.3844/jcssp.2020.1558.1569

Cited by 15 publications

(8 citation statements)

References 40 publications

(48 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The most widely used data sets in SDP are the Predictor Models in Software Engineering (PROMISE), and NASA Metrics Data Program (MDP)m according to Saharudin et al [ 6 ]. It was observed that 43.3% of each adopted data set was considered in research experiments, while in total usage, 86.6% was due to the open-source nature.…”

Section: Methodsmentioning

confidence: 99%

“…Performance metrics are important indicators to measure and assess the quality of ML models. Saharudin et al [ 6 ] found that, for SDP, the most widely included types of numerical quantification measurements are Area Under Curve (AUC), based on the results of the Receiver Operating Characteristic (ROC) curve, hqving 56.7%, Recall, with 46.7%, F-Measure/F1-Measure, with 36.7%, Precision, with 30%, Accuracy, with 26.7%, and Other numerical measurements with 76.7%.…”

Section: Methodsmentioning

confidence: 99%

“…Saharudin et al [ 6 ] found that defects can occur at any stage during the development process, possibly remaining hidden and only becoming active at deployment. This has many real-world consequences or drawbacks, as ever-evolving software becomes more integrated into many aspects of our daily lives.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications

Mcmurray

Sodhro

2023

Sensors

View full text Add to dashboard Cite

Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these systems increasing, the ability to accurately identify a defective model using Machine Learning (ML) has been overlooked and less addressed. Thus, this article contributes an investigation of various ML techniques for SDP. An investigation, comparative analysis and recommendation of appropriate Feature Extraction (FE) techniques, Principal Component Analysis (PCA), Partial Least Squares Regression (PLS), Feature Selection (FS) techniques, Fisher score, Recursive Feature Elimination (RFE), and Elastic Net are presented. Validation of the following techniques, both separately and in combination with ML algorithms, is performed: Support Vector Machine (SVM), Logistic Regression (LR), Naïve Bayes (NB), K-Nearest Neighbour (KNN), Multilayer Perceptron (MLP), Decision Tree (DT), and ensemble learning methods Bootstrap Aggregation (Bagging), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Random Forest(RF), and Generalized Stacking (Stacking). Extensive experimental setup was built and the results of the experiments revealed that FE and FS can both positively and negatively affect performance over the base model or Baseline. PLS, both separately and in combination with FS techniques, provides impressive, and the most consistent, improvements, while PCA, in combination with Elastic-Net, shows acceptable improvement.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications

Mcmurray

Sodhro

2023

Sensors

View full text Add to dashboard Cite

show abstract

“…Metrics can also be classified based on development phase of software life cycle, into source code level metrics, detailed design level metrics or test level metrics. Objectoriented metrics are often used to assess the testability, maintainability or reusability of source code [20], [35]. Commonly dataset that used for software bug prediction domain is promise repository dataset.…”

Section: Software Metrics (Features) and Datasetsmentioning

confidence: 99%

Comprehensive Study on Machine Learning Techniques for Software Bug Prediction

Khleel¹,

Nehéz²

2021

IJACSA

View full text Add to dashboard Cite

Software bugs are defects or faults in computer programs or systems that cause incorrect or unexpected operations. These negatively affect software quality, reliability, and maintenance cost; therefore many researchers have already built and developed several models for software bug prediction. Till now, a few works have been done which used machine learning techniques for software bug prediction. The aim of this paper is to present comprehensive study on machine learning techniques that were successfully used to predict software bug. Paper also presents a software bug prediction model based on supervised machine learning algorithms are Decision Tree (DT), Naïve Bayes (NB), Random Forest (RF) and Logistic Regression (LR) on four datasets. We compared the results of our proposed models with those of the other studies. The results of this study demonstrated that our proposed models performed better than other models that used the same data sets. The evaluation process and the results of the study show that machine learning algorithms can be used effectively for prediction of bugs.

show abstract

“…During the life cycle's development phase, which also includes planning, deployment, design, testing, problem assessment, development, along with continuation, as well as software development life cycle models, machine learning techniques are used to anticipate software bugs. Machine learning techniques and statistical analysis may both be used to anticipate bugs (Saharudin, S.N et al, 2020) [3]. During the software development process, many approaches are employed to get better excellence.…”

Section: Introductionmentioning

confidence: 99%

Moth Flame Optimization Based FCNN for Prediction of Bugs in Software

Anjali¹,

Dhas²,

Singh³

2023

Intelligent Automation &Amp; Soft Computing

View full text Add to dashboard Cite

The software engineering technique makes it possible to create highquality software. One of the most significant qualities of good software is that it is devoid of bugs. One of the most time-consuming and costly software procedures is finding and fixing bugs. Although it is impossible to eradicate all bugs, it is feasible to reduce the number of bugs and their negative effects. To broaden the scope of bug prediction techniques and increase software quality, numerous causes of software problems must be identified, and successful bug prediction models must be implemented. This study employs a hybrid of Faster Convolution Neural Network and the Moth Flame Optimization (MFO) algorithm to forecast the number of bugs in software based on the program data itself, such as the line quantity in codes, methods characteristics, and other essential software aspects. Here, the MFO method is used to train the neural network to identify optimal weights. The proposed MFO-FCNN technique is compared with existing methods such as AdaBoost (AB), Random Forest (RF), K-Nearest Neighbour (KNN), K-Means Clustering (KMC), Support Vector Machine (SVM) and Bagging Classifier (BC) are examples of machine learning (ML) techniques. The assessment method revealed that machine learning techniques may be employed successfully and through a high level of accuracy. The obtained data revealed that the proposed strategy outperforms the traditional approach.

show abstract

Machine Learning Techniques for Software Bug Prediction: A Systematic Review

Cited by 15 publications

References 40 publications

A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications

A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications

Comprehensive Study on Machine Learning Techniques for Software Bug Prediction

Moth Flame Optimization Based FCNN for Prediction of Bugs in Software

Contact Info

Product

Resources

About