2019
DOI: 10.1145/3328746
|View full text |Cite
|
Sign up to set email alerts
|

Experience

Abstract: Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin numerous process and project management activities, including the estimation of development effort and the prediction of the likely location and severity of defects in code. Serious questions have been raised, however, over the quality of the data used in ESE. Data quality problems caused by noise, outliers, and incompleteness have been noted as being especially prevalent. Other quality issues, although also pote… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 31 publications
(13 citation statements)
references
References 104 publications
(118 reference statements)
0
11
0
Order By: Relevance
“…This selection happens multiple time in each generation to fill the mating pool [65]. The tournament selector, on the other hand, is used to select the best n solutions (usually n ∈ [1,10]) to be copied straight into the next generation [89]. Each tournament randomly picks a number of solutions from the population and selects the fittest one.…”
Section: Confidence Guided Effort Estimation (Cogee)mentioning
confidence: 99%
See 3 more Smart Citations
“…This selection happens multiple time in each generation to fill the mating pool [65]. The tournament selector, on the other hand, is used to select the best n solutions (usually n ∈ [1,10]) to be copied straight into the next generation [89]. Each tournament randomly picks a number of solutions from the population and selects the fittest one.…”
Section: Confidence Guided Effort Estimation (Cogee)mentioning
confidence: 99%
“…We used the same five publicly available datasets used in the original study to empirically investigate our research questions. These five datasets, namely China, Desharnais, Finnish, Maxwell, and Miyazaki, have been extensively used in previous software effort estimation studies [10].…”
Section: Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…MacDonell et al [12] provided a transparent and consistent means of collection and evaluation of data that could lead to the use of higher quality data in software engineering experiments. They assessed the quality of 13 empirical software engineering datasets with the aim of benchmarking them against the data quality taxonomy proposed by Bosu and MacDonell [13] based on three groups: accuracy, relevance and provenance.…”
Section: Related Workmentioning
confidence: 99%