2018
DOI: 10.1002/smr.2114
|View full text |Cite
|
Sign up to set email alerts
|

Support vector regression‐based imputation in analogy‐based software development effort estimation

Abstract: Missing data (MD) is a widespread problem that can affect the ability to use data to construct effective software development effort estimation (SDEE) techniques. To deal with this challenge, several imputation techniques have been investigated in SDEE and k‐nearest neighbors (KNN)‐based imputation is still the most frequently used. To the best of our knowledge, no study has used support vector regression (SVR)‐based imputation to construct accurate estimation techniques, in particular those based on analogy. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
18
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2

Relationship

4
2

Authors

Journals

citations
Cited by 17 publications
(18 citation statements)
references
References 66 publications
0
18
0
Order By: Relevance
“…• Parameter of fuzziness m controls the extent of sharing among fuzzy clusters. 24,48 In other words, low values of m mean that the project is more likely to belong to one cluster while high values mean that the project is more likely to belong to more clusters. Based on our previous study, 24 we range the parameter m from 1.5 to 3.5 with increments of 0.5.…”
Section: Empirical Designmentioning
confidence: 99%
See 2 more Smart Citations
“…• Parameter of fuzziness m controls the extent of sharing among fuzzy clusters. 24,48 In other words, low values of m mean that the project is more likely to belong to one cluster while high values mean that the project is more likely to belong to more clusters. Based on our previous study, 24 we range the parameter m from 1.5 to 3.5 with increments of 0.5.…”
Section: Empirical Designmentioning
confidence: 99%
“…49 The most commonly used approach to decide the optimal cluster number is executing the clustering algorithm several times with a different number of clusters and then selecting the cluster number that provides the best result according to a predefined criterion. 48 The predefined criterion function is called the cluster validity index.…”
Section: Empirical Designmentioning
confidence: 99%
See 1 more Smart Citation
“…In this framework, the observed data are considered as a training set for the learning model, which is then applied to the data with missing values to impute. K-Nearest Neighbor (KNN) [ 17 ], Decision Tree (DT) [ 18 ] and Support Vector Regression (SVR) [ 19 ] are the most used ML techniques for imputation and achieved great success [ 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, feature subset selection techniques should be applied to find the optimal set of features . Another important limitation of ASEE techniques is their inability to handle missing values …”
Section: Introductionmentioning
confidence: 99%