2020 IEEE 36th International Conference on Data Engineering (ICDE) 2020
DOI: 10.1109/icde48307.2020.00182
|View full text |Cite
|
Sign up to set email alerts
|

Automated Anomaly Detection in Large Sequences

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 46 publications
(15 citation statements)
references
References 17 publications
0
13
0
Order By: Relevance
“…Thus, for each method and condition, our results are based on a total of N × n t = 20K measurements. For all progressive methods, we test the accuracy of their estimates after the similarity search algorithm has visited 1 (2 0 ), 4 (2 2 ), 16 (2 4 ), 64 (2 6 ), 256 (2 8 ), and 1024 (2 10 ) leaves. Figure 7 shows the distributions of visited leaves for 100 random queries for all four datasets.…”
Section: Experimental Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, for each method and condition, our results are based on a total of N × n t = 20K measurements. For all progressive methods, we test the accuracy of their estimates after the similarity search algorithm has visited 1 (2 0 ), 4 (2 2 ), 16 (2 4 ), 64 (2 6 ), 256 (2 8 ), and 1024 (2 10 ) leaves. Figure 7 shows the distributions of visited leaves for 100 random queries for all four datasets.…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…Data series analysis involves pattern matching [54,91], anomaly detection [10,11,17,24], frequent pattern mining [56,72], clustering [48,73,74,86], and classification [19]. These tasks rely on data series similarity.…”
Section: Introductionmentioning
confidence: 99%
“…An increasing number of applications across many diverse domains continuously produce very large amounts of data series 1 (such as in finance, environmental sciences, astrophysics, neuroscience, engineering, and others [1]- [3]), which makes them one of the most common types of data. When these sequence collections are generated (often times composed of a large number of short series [3], [4]), users need to query and analyze them (e.g., detect anomalies [5], [6]). This process is heavily dependent on data series similarity search (which apart from being a useful query in itself, also lies at the core of several machine learning methods, such as, clustering, classification, motif and outlier detection, etc.)…”
Section: Introductionmentioning
confidence: 99%
“…This process is heavily dependent on data series similarity search (which apart from being a useful query in itself, also lies at the core of several machine learning methods, such as, clustering, classification, motif and outlier detection, etc.) [8,9,15,44]. The brute-force approach for evaluating similarity search queries is by performing a sequential pass over the complete dataset.…”
mentioning
confidence: 99%