2013
DOI: 10.1007/s10115-012-0606-6
|View full text |Cite
|
Sign up to set email alerts
|

Beyond one billion time series: indexing and mining very large time series collections with $$i$$ SAX2+

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
152
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 89 publications
(152 citation statements)
references
References 25 publications
0
152
0
Order By: Relevance
“…Most methods are based on longest common sub-sequence algorithm [17], [18]. However, these methods are not ideal for IoT/M2M data for two main reasons.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Most methods are based on longest common sub-sequence algorithm [17], [18]. However, these methods are not ideal for IoT/M2M data for two main reasons.…”
Section: Background and Related Workmentioning
confidence: 99%
“…For instance, iSAX requires more than 6 days to index 100 million (10 8 ) time-series data [Camerra et al 2010]. However, [Camerra et al 2014] argue that it requires two days to build the same data size and 20 days to build 500 million (5 × 10 8 ) time-series data. iSAX requires a long time to build indexes because two main reasons: a) Ine cient splitting policy b) No bulk loading scheme.…”
Section: Symbolic Data Indexing Approachmentioning
confidence: 99%
“…However, similar to SAX, determining iSAX parameters relies heavily on the data. Moreover, once the root's and the child nodes' representations are constructed, it is not possible to update them [Camerra et al 2014], which is a constraint; especially if we consider using iSAX in indexing on-line time-series data which requires the indexing mechanism to be continuously updated with no prior knowledge of the data size. Intuitively, iSAX does not allow the child nodes to be represented by a higher cardinality once they are created.…”
Section: Thematic Datamentioning
confidence: 99%
“…At the same time we have witnessed an increased interest in data series management and processing [32,23,22,9], related to data produced by sensors, or scientific experiments.…”
Section: Related Workmentioning
confidence: 99%
“…Let D be a dataset with N = 4 and m = 3. Let S1, S2, S3 and S4 be the instantiated distance partitions: S1 = {[2, 2] : 0.33, [4,4] : 0.33, [6,6] : 0.33}, S2 = { [4,8] : 1}, S3 = {[1, 1] : 0.33, [5,5] : 0.33, [9,9] : 0.33} and S4 = {[1, 1] : 0.33, [3,3] : 0.33, [7,7] : 0.33}. The PNN probability estimates determined using the Eq.4 and Eq.…”
Section: Lemma 2 (Dependencies In Distance Partitions)mentioning
confidence: 99%