2017
DOI: 10.14778/3115404.3115410
|View full text |Cite
|
Sign up to set email alerts
|

Time series data cleaning

Abstract: Errors are prevalent in time series data, such as GPS trajectories or sensor readings. Existing methods focus more on anomaly detection but not on repairing the detected anomalies. By simply filtering out the dirty data via anomaly detection, applications could still be unreliable over the incomplete time series. Instead of simply discarding anomalies, we propose to (iteratively) repair them in time series data, by creatively bonding the beauty of temporal nature in anomaly detection with the widely considered… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 99 publications
(21 citation statements)
references
References 20 publications
0
14
0
1
Order By: Relevance
“…To remove errors, data cleaning is required: defined as the "process of detecting, diagnosing, and editing faulty data" [6]. Ideally, data cleaning methods should prioritise data repair over data removal [7] and use computer programs to improve reproducibility [8].…”
Section: Introductionmentioning
confidence: 99%
“…To remove errors, data cleaning is required: defined as the "process of detecting, diagnosing, and editing faulty data" [6]. Ideally, data cleaning methods should prioritise data repair over data removal [7] and use computer programs to improve reproducibility [8].…”
Section: Introductionmentioning
confidence: 99%
“…(3) Consistency: The evaluation needs to assess whether the data items in a data set contradict with each other. It is often known as anomaly [46] or outlier [16] in time series data. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.…”
Section: Introductionmentioning
confidence: 99%
“…Most of the existing faulty data detection techniques are designed for offline applications where the entire data set is available. Moreover, the methods which are proposed for online data cleaning are mainly effective for repairing spike errors [9].…”
Section: Introductionmentioning
confidence: 99%
“…Smoothing techniques are widely used for online data cleaning to eliminate noisy data [9]. Moving average (MA) [10], weighted moving average (WMA) [11], exponentially weighted moving average (EWMA) [11], and sliding window bottom-up (SWAB) [12] are examples of smoothing methods.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation