2018
DOI: 10.20944/preprints201805.0248.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Rough Noise-Filtered Easy Ensemble for Software Fault Prediction

Abstract: Software fault prediction is the very consequent research topic for software quality assurance. Data driven approaches provide robust mechanisms to deal with software fault prediction. However, the prediction performance of the model highly depends on the quality of dataset. Many software datasets suffers from the problem of class imbalance. In this regard, under-sampling is a popular data pre-processing method in dealing with class imbalance problem, Easy Ensemble (EE) present a robust approach to achieve a h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 46 publications
0
2
0
Order By: Relevance
“…Catal et al [50] conducted a study over class and noise detection; they proposed a detection algorithm based on software feature threshold values. Riazz et al [51] proposed a two-stage data preprocessing methods that incorporates the feature selection and noise filter execution; they employed Knearest neighbor and ensemble learning in their proposed approach. Alan et al [52] proposed an outlier detection approach using metrics threshold and class label; they employed NASA datasets to identify class outliers; they found the proposed model outperforms over baseline methods.…”
Section: Quality Of Defect Datasetmentioning
confidence: 99%
“…Catal et al [50] conducted a study over class and noise detection; they proposed a detection algorithm based on software feature threshold values. Riazz et al [51] proposed a two-stage data preprocessing methods that incorporates the feature selection and noise filter execution; they employed Knearest neighbor and ensemble learning in their proposed approach. Alan et al [52] proposed an outlier detection approach using metrics threshold and class label; they employed NASA datasets to identify class outliers; they found the proposed model outperforms over baseline methods.…”
Section: Quality Of Defect Datasetmentioning
confidence: 99%
“…However, Multi-class imbalanced data is encountered in real-world applications [16]- [18], the problem of overlapping between the distinct classes in datasets make a more challenging problem than binary imbalanced learning. In the recent study, to address the multiclass imbalanced data issue for classification either adopt ensemble-based approaches [19], [20], [21], [71] decomposition strategies [22], in which the multi-class imbalanced data is divided into several binary class subsets, which is easyto-solve.…”
Section: Introductionmentioning
confidence: 99%