Proceedings of the 2016 International Conference on Management of Data 2016
DOI: 10.1145/2882903.2912574
|View full text |Cite
|
Sign up to set email alerts
|

Data Cleaning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
46
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 302 publications
(69 citation statements)
references
References 63 publications
0
46
0
Order By: Relevance
“…Following the decoding interview proceedings, a narrative data analysis approach as suggested by Creswell and Creswell (2017) was used to transcribe the audio recordings made during the decoding interview sessions and to analyse the data. After transcribing the data, it was cleansed by searching for faults and repairing them (Chu et al, 2016). As the discussions were open-ended, the transcripts also contained some illogical and repeated statements.…”
Section: Discussionmentioning
confidence: 99%
“…Following the decoding interview proceedings, a narrative data analysis approach as suggested by Creswell and Creswell (2017) was used to transcribe the audio recordings made during the decoding interview sessions and to analyse the data. After transcribing the data, it was cleansed by searching for faults and repairing them (Chu et al, 2016). As the discussions were open-ended, the transcripts also contained some illogical and repeated statements.…”
Section: Discussionmentioning
confidence: 99%
“…Data cleaning techniques enable the detection and repair of data errors by identifying integrity constraints, duplicates, functional dependencies, and so on. These activities can be automatic or human-guided, and error detection techniques can be applied on the original database or later on in the data processing pipeline [13]. In this paper, the discovery of errors after some processing and integration is of particular importance given the use of large-scale cross comparisons and the integration of external sources.…”
Section: G Scalia Et Al / Towards a Scientific Data Framework To Sumentioning
confidence: 99%
“…The ChemKED format is supported through the PyKED utility (https://github.com/pr-omethe-us/PyKED) which allow to convert a ChemKED file in an equivalent ReSpecTh file 13. An experiment is interpretable by the framework if its definition has been added to the domain knowledge.…”
mentioning
confidence: 99%
“…After completing the data acquisition and collection, data cleaning checked the resulting dataset to identify and correct possible errors such as missing values, outlier values or different data formats (Chu, Ilyas, Krishnan, & Wang, 2016). This guarantees the highest degree of data reliability.…”
Section: Data Preparationmentioning
confidence: 99%