2020
DOI: 10.2197/ipsjjip.28.102
|View full text |Cite
|
Sign up to set email alerts
|

Recommendation of Imputing Value for Sensor Data based on Programming by Example

Abstract: Large volumes of data are typically used during analyses. Data preprocessing, which involves detecting outliers, handling missing data, data formatting, integration, and normalization, is essential for achieving accurate results. Many tools and methods are available for reducing preprocessing time. However, most analysts face difficulties when using them. This paper proposes a method for handling outliers and missing data, called Automated PRE-Processing for Sensor Data (APREP-S). For reducing analysis resourc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…While only one variable was missing half the data that was filled with linear regression, real data might have randomly distributed missing data for several variables. In that case, more sophisticated machine‐learning approaches are excellent for imputing missing data (Nagashima & Kato, 2020; Gruenwald et al., 2007).…”
Section: Discussionmentioning
confidence: 99%
“…While only one variable was missing half the data that was filled with linear regression, real data might have randomly distributed missing data for several variables. In that case, more sophisticated machine‐learning approaches are excellent for imputing missing data (Nagashima & Kato, 2020; Gruenwald et al., 2007).…”
Section: Discussionmentioning
confidence: 99%
“…We previously proposed "APREP-S," which conducts automatic pre-processing by integrating human knowledge with machine learning via the PBE approach [3], [4]. In this study, we enhance the previous APREP-S and propose a workflow to select site-specific features at each project site, and use clustering as the classification method.…”
Section: Proposed Methods -Aprep-smentioning
confidence: 99%
“…This led us to previously propose APREP-S (Automated Preprocessing for sensor data) [3], [4] to reduce the resources that need to be devoted to pre-processing. This is a method that integrates machine learning based on Bayesian inference with human 1 Tokyo Woman's Christian University, Suginami, Tokyo 167-8585, Japan a) h18m001@cis.twcu.ac.jp b) yuka@lab.twcu.ac.jp knowledge using programming by example (PBE) approach -details of the proposed APREP-S are provided in Section 3, and PBE approach is explained in Section 1.2.…”
Section: Introductionmentioning
confidence: 99%