2021
DOI: 10.21655/ijsi.1673-7288.00239
|View full text |Cite
|
Sign up to set email alerts
|

Time Series Data Cleaning under Multi-Speed Constraints

Abstract: As the basis of data management and analysis, data quality issues have increasingly become a research hotspot in related fields, which contributes to optimization of big data and artificial intelligence technology. Generally, physical failures or technical defects in data collectors and recorders cause anomalies in collected data. These anomalies will strongly impact on subsequent data analysis and artificial intelligence processes; thus, data should be processed and cleaned accordingly before application. Exi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 20 publications
(23 reference statements)
0
4
0
Order By: Relevance
“…Due to the strict equality relationships required by functional dependencies and rule constraints, it is difficult to discover absolute consistency in real time series datasets. To this end, Gao et al [27] proposed a time series restoration method based on multi-interval velocity constraints. Related studies indicate that, for the restoration issues pertaining to time series data, having accurate timestamps ensures satisfactory restoration effects.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the strict equality relationships required by functional dependencies and rule constraints, it is difficult to discover absolute consistency in real time series datasets. To this end, Gao et al [27] proposed a time series restoration method based on multi-interval velocity constraints. Related studies indicate that, for the restoration issues pertaining to time series data, having accurate timestamps ensures satisfactory restoration effects.…”
Section: Related Workmentioning
confidence: 99%
“…Since only a single attribute variable is involved, the speed constraint based repairing method on a single sequence can be converted to a linear programming problem for solving, compared to the NP time complexity of conditional function-dependent and denial-constrained restoration algorithms, which is earlier research on rule-based data cleaning. On this basis, in 2018, Yin et al [74] proposed data cleaning method combining variance constraints with speed constraints. In 2021, Gao et al [75] proposed a repair method with multi-interval speed constraints, and Song et al [42] proposed a repair method combining speed constraints and acceleration constraints, which improved the practicality of multi-entity singleattribute rules.…”
Section: Rule-based Error Repairmentioning
confidence: 99%
“…Repair methods based on velocity constraints [41,73] ; repair methods combining variance constraints with velocity constraints [74] .…”
Section: Single-entity Multiattributementioning
confidence: 99%
“…Li et al [7] used an improved A * shortest path algorithm to fully consider road network topology and historical matching points and proposed a new trajectory restoration algorithm. Gao et al [8] used a dynamic programming method to set multiple intervals for sequence data, and searched for candidate repair points in an iterative manner, avoiding excessive repair of sequence data.…”
Section: Related Workmentioning
confidence: 99%
“…Between January 28 and February 1, 2021, 624,307 AIS data points from 522 vessels were selected as experimental data. If there are at least MinPts points in ϵ field of p i , then// e mercator distance between two points can be found in the M (8) Initialize C temp � ∅, add p to C temp (9) Let N be the points set in the ϵ field of p i (10) For each p i ′ in N (11) If p i ′ is unvisited, then (12) Mark p i ′ as visited (13) If there are at least MinPts points in the ϵ field of p i ′ , then (14) Add points to N (15) End If (16) If p i ′ is not a member of any cluster, then (17) Add p i ′ to C temp (18) End If (19) End If (20) End For (21) Add C temp to C (22) Else mark p i as noise point (23) End If (24) Until all the points are marked, C � C 1 , C 2 , . .…”
Section: Experimental Environment and Datasetmentioning
confidence: 99%