2019
DOI: 10.3390/app9132764
|View full text |Cite
|
Sign up to set email alerts
|

Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach

Abstract: Software Defect Prediction (SDP) models are built using software metrics derived from software systems. The quality of SDP models depends largely on the quality of software metrics (dataset) used to build the SDP models. High dimensionality is one of the data quality problems that affect the performance of SDP models. Feature selection (FS) is a proven method for addressing the dimensionality problem. However, the choice of FS method for SDP is still a problem, as most of the empirical studies on FS methods fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
76
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 73 publications
(80 citation statements)
references
References 40 publications
4
76
0
Order By: Relevance
“…It has been reported by many researchers that in most of the datasets only few of the independent features can predict the target class effectively and remaining features don't only participate but can reduce the performance of classification model, if not removed. In this research, we have incorporated an aggregation based multi-filter feature selection technique, in which CFS [28,29,30] is used as attribute evaluator along with four widely used search methods including: GA, PSO, BFS, and FS. For each of the used dataset, feature selection is performed with all of these four search methods.…”
Section: Methodsmentioning
confidence: 99%
“…It has been reported by many researchers that in most of the datasets only few of the independent features can predict the target class effectively and remaining features don't only participate but can reduce the performance of classification model, if not removed. In this research, we have incorporated an aggregation based multi-filter feature selection technique, in which CFS [28,29,30] is used as attribute evaluator along with four widely used search methods including: GA, PSO, BFS, and FS. For each of the used dataset, feature selection is performed with all of these four search methods.…”
Section: Methodsmentioning
confidence: 99%
“…As, it has been proved now from the studies [2], [5] that those features should be removed from the dataset which do not participate in classification process as these features may reduce the performance. In this research, feature selection is performed by "Cfs Subset Evaluator" [28,29,30] with BestFirst search method, whereas full dataset is given for training. In this approach, three directions are used for feature subset selection: Forward, Backward, and Bi-Directional.…”
Section: Methodsmentioning
confidence: 99%
“…Feature selection is a dimensionality reduction technique that involves selecting relevant and irredundant features from a dataset [24]. In this study, a filter-based feature selection method, Chi-Square (CS), was incorporated to reduce the number of features used in training the model.…”
Section: Feature Extractionmentioning
confidence: 99%