2017 5th Intl Conf on Applied Computing and Information Technology/4th Intl Conf on Computational Science/Intelligence and Appl 2017
DOI: 10.1109/acit-csii-bcd.2017.49
|View full text |Cite
|
Sign up to set email alerts
|

A Survey on Big Data Pre-processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 61 publications
0
6
0
Order By: Relevance
“…On the other hand, the data series provided by government database can be incomplete or erroneous because of unreliable transmission. Data preparation in these contexts can be generally classified into three main groups, namely data cleansing, integration, and reduction as investigated in [111]. In FADE, cleansing is envisaged to deal with incompleteness and erroneous conditions.…”
Section: Data Preparationmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, the data series provided by government database can be incomplete or erroneous because of unreliable transmission. Data preparation in these contexts can be generally classified into three main groups, namely data cleansing, integration, and reduction as investigated in [111]. In FADE, cleansing is envisaged to deal with incompleteness and erroneous conditions.…”
Section: Data Preparationmentioning
confidence: 99%
“…The data will then be shared before combined or summarized. The types of textual data collected are not necessarily identical because this may also include data integration [111]. However, preprocessing applied to the collected data requires computation resources if performed individually and locally.…”
Section: Peer-offloading Data Preparationmentioning
confidence: 99%
“…Data was cleaned using tokenization, with stemming and lemmatization [12]. Noise such as hashtags, punctuation, special characters, reluctant words, along with other irrelevant data, were removed.…”
Section: Data Pre-processingmentioning
confidence: 99%
“…Data preparation or also known as data pre-processing is a vital process in majority of machine learning projects and activities [1]. Depending on the nature of data, there are many methods of data pre-processing that can be applied and data discretization is one of those methods [2].…”
Section: Introductionmentioning
confidence: 99%