Proceedings of the 7th International Conference on Predictive Models in Software Engineering 2011
DOI: 10.1145/2020390.2020407
|View full text |Cite
|
Sign up to set email alerts
|

An empirical evaluation of outlier deletion methods for analogy-based cost estimation

Abstract: Background: Any software project dataset sometimes includes outliers which affect the accuracy of effort estimation. Outlier deletion methods are often used to eliminate them. However, there are few case studies which apply outlier deletion methods to analogy-based estimation, so it is not clear which method is more suitable for analogy-based estimation. Aim: Clarifying the effects of existing outlier deletion methods (Cook's distance based deletion, LTS based deletion, k-means based deletion, Mantel's correla… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 25 publications
0
8
0
Order By: Relevance
“…The combination of scaling scheme and missing-data treatment (MDT) is firstly analyzed; however, their impacts onto the ML method were not studied. Many studies propose one or more DP techniques to deal with a specific issue in SEC, such as data missingness [30][31][32], redundant or irrelevant features [13,28], or abnormal cases [33,34]. But they did not study the effectiveness of different DP techniques.…”
Section: Parametric Modelsmentioning
confidence: 99%
See 3 more Smart Citations
“…The combination of scaling scheme and missing-data treatment (MDT) is firstly analyzed; however, their impacts onto the ML method were not studied. Many studies propose one or more DP techniques to deal with a specific issue in SEC, such as data missingness [30][31][32], redundant or irrelevant features [13,28], or abnormal cases [33,34]. But they did not study the effectiveness of different DP techniques.…”
Section: Parametric Modelsmentioning
confidence: 99%
“…Table 1 also shows that many studies use combined DPs. For examples, there are 7 of totally 48 works combined only scaling and FS/CS [20,27,34,[52][53][54][55], and 7 of 48 works combined only MDTs and FS/CS [19,23,36,[56][57][58][59]. FS and CS have been considered as a necessary step for CBR in several studies [6,14,23,26,33,37,47,48,52,[55][56][57][60][61][62][63].…”
Section: 1mentioning
confidence: 99%
See 2 more Smart Citations
“…This step deals with two imposing problems in data collection, namely, missing value and outlier detection [22]. Missing value is a common phenomenon in project management where numerous small activities, assignments, and work products are too trivial, hard to identify or account for, or due to human errors.…”
Section: Missing Value and Outliermentioning
confidence: 99%