Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2015
DOI: 10.1145/2783258.2783286
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy

Abstract: Clustering time series is a useful operation in its own right, and an important subroutine in many higher-level data mining analyses, including data editing for classifiers, summarization, and outlier detection. While it has been noted that the general superiority of Dynamic Time Warping (DTW) over Euclidean Distance for similarity search diminishes as we consider ever larger datasets, as we shall show, the same is not true for clustering. Thus, clustering time series under DTW remains a computationally challe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
72
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 93 publications
(77 citation statements)
references
References 28 publications
0
72
0
Order By: Relevance
“…For example, a benchmark dataset like StarLightCurves from the UCR Time Series Data Mining Archive 2 with 9236 time series, each of length 1024, is composed of 4.83e 9 subsequences. Real-world datasets tend to be orders of magnitude larger than this 3 . Performing similarity comparisons for all these subsequences is clearly impractical.…”
Section: Research Challenges and Limitations Ofmentioning
confidence: 96%
See 1 more Smart Citation
“…For example, a benchmark dataset like StarLightCurves from the UCR Time Series Data Mining Archive 2 with 9236 time series, each of length 1024, is composed of 4.83e 9 subsequences. Real-world datasets tend to be orders of magnitude larger than this 3 . Performing similarity comparisons for all these subsequences is clearly impractical.…”
Section: Research Challenges and Limitations Ofmentioning
confidence: 96%
“…[21] does data editing to make its classifier faster and more accurate, while we have to keep the original data to support accurate similarity searches and return actual sequence results. [3] introduces clusters constructed using DTW and a modified Density Peaks algorithm to improve the clustering performance. Using upper and lower bounds of DTW, TADPole can prune unnecessary computations.…”
Section: Related Workmentioning
confidence: 99%
“…Energies 2016, 9, 561 6 of 11 DTW utilizes dynamic programming to find the optimal mapping of points in two sequences and compute the distance between them. Detailed steps for computing the DTW distance can be found in several papers [20,21], and efficient algorithms can be found in [17].…”
Section: Dtw Distancementioning
confidence: 99%
“…The battery charge/discharge sequences are first denoised. The dynamic time warping (DTW) distances [17] between different battery charge/discharge sequences are then calculated, followed by a normalization step to form a similarity matrix, which is utilized by the affinity propagation (AP) algorithm [18] for clustering.…”
Section: Introductionmentioning
confidence: 99%
“…There are lots of different methods nowadays for solving the task. A special attention in clustering data arrays is given to issues related to time series processing, which usually imply (except for actual clustering) segmentation, detection of property changes (fault detection) and abnormal outliers [13]. Thereby, clustering should be performed not only in space, but also in time.…”
Section: Introductionmentioning
confidence: 99%