2014
DOI: 10.1007/s10489-014-0553-x
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing very large time series using suffix arrays

Abstract: Suffix arrays form a powerful data structure for pattern detection and matching. In a previous work, we presented a novel algorithm (COV) which is the only algorithm that allows the detection of all repeated patterns in a time series by using the actual suffix array. However, the requirements for storing the actual suffix strings even on external media makes the use of suffix arrays impossible for very large time series. We have already proved that using the concept of Longest Expected Repeated Pattern (LERP) … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(14 citation statements)
references
References 27 publications
0
14
0
Order By: Relevance
“…The protocol can be utilized in emerging cloud computing infrastructure where timestamp based privacy preserving data can be classified in a cloud among clients and server. In Xylogiannopoulos et al (2014), authors proposed COV method that utilizes suffix arrays, LERP method that allows actual suffixes stored in linear capacity of external media and MLERP methods that analyze very large time series by maximal utilization of available hardware.…”
Section: Background Study and Related Workmentioning
confidence: 99%
“…The protocol can be utilized in emerging cloud computing infrastructure where timestamp based privacy preserving data can be classified in a cloud among clients and server. In Xylogiannopoulos et al (2014), authors proposed COV method that utilizes suffix arrays, LERP method that allows actual suffixes stored in linear capacity of external media and MLERP methods that analyze very large time series by maximal utilization of available hardware.…”
Section: Background Study and Related Workmentioning
confidence: 99%
“…As regards perspective (i), the work in [40], for instance, introduces a tool for detecting all repeated patterns in a time series regardless of its size and hardware limitations. It focuses on the problem of repeated pattern detection in a single time series, and extends existing approaches to deal with larger data.…”
Section: Related Workmentioning
confidence: 99%
“…The method proposed in this paper is based on the Suffix Array data structure that is used to detect all repeated patterns in a sequence. More specifically, the ARPaD Algorithm [19] is used as it has been derived by COV Algorithm [15], [16], [17]. A Suffix Array is a data structure that contains an array of all suffixes of a string [20] and it is mainly used for pattern detection.…”
Section: Our Approachmentioning
confidence: 99%
“…This is needed for the ARPaD Algorithm in order to perform the analysis as the strings have directly come from a Suffix Array data structure. This is the most time consuming part since it has complexity O(nlogn), while ARPaD Algorithm has been proven experimentally to have on average complexity O(n) [16], [18], [19]. Therefore, the total complexity of the method is on average O(nlogn) which allows a very fast analysis of the IP addresses data.…”
Section: Our Approachmentioning
confidence: 99%
See 1 more Smart Citation