2013 22nd Australian Software Engineering Conference 2013
DOI: 10.1109/aswec.2013.21
|View full text |Cite
|
Sign up to set email alerts
|

A Taxonomy of Data Quality Challenges in Empirical Software Engineering

Abstract: Abstract

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
29
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(29 citation statements)
references
References 63 publications
0
29
0
Order By: Relevance
“… Amount of data: the amount of data available for model building contributes to relevance in terms of goal attainment [7]; small and imbalanced data sets build inaccurate models.…”
Section: Journal Of Computersmentioning
confidence: 99%
See 3 more Smart Citations
“… Amount of data: the amount of data available for model building contributes to relevance in terms of goal attainment [7]; small and imbalanced data sets build inaccurate models.…”
Section: Journal Of Computersmentioning
confidence: 99%
“… Inconsistency: refers to a lack of harmony between different parts or elements; instances that are self-contradictory, or lacking in agreement when it is expected [7]. This problem is also known as mislabeled data or class noise.…”
Section: Data Quality Diagnosismentioning
confidence: 99%
See 2 more Smart Citations
“…In this paper we present a systematic review for data quality issues in knowledge discovery tasks as: heterogeneity, outliers, noise, inconsistency, incompleteness, amount of data, redundancy and timeliness which are defined in [7][8] and a case study in agricultural diseases: the coffee rust. This paper is organized as follows.…”
Section: Introductionmentioning
confidence: 99%