2019
DOI: 10.1007/s10618-019-00617-3
|View full text |Cite
|
Sign up to set email alerts
|

Proximity Forest: an effective and scalable distance-based classifier for time series

Abstract: Research into the classification of time series has made enormous progress in the last decade. The UCR time series archive has played a significant role in challenging and guiding the development of new learners for time series classification. The largest dataset in the UCR archive holds 10 thousand time series only; which may explain why the primary research focus has been on creating algorithms that have high accuracy on relatively small datasets.This paper introduces Proximity Forest, an algorithm that lear… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
88
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 128 publications
(88 citation statements)
references
References 46 publications
0
88
0
Order By: Relevance
“…BOSS has a training time complexity of O(n 2 · 2 ) [38]. Variations of BOSS such BOSS-VS [39] and WEASEL [41] were developed to be more scalable, but significantly sacrifice accuracy [31].…”
Section: Dictionary-based Techniquesmentioning
confidence: 99%
See 2 more Smart Citations
“…BOSS has a training time complexity of O(n 2 · 2 ) [38]. Variations of BOSS such BOSS-VS [39] and WEASEL [41] were developed to be more scalable, but significantly sacrifice accuracy [31].…”
Section: Dictionary-based Techniquesmentioning
confidence: 99%
“…Training time complexity Proximity Forest, on which TS-CHIEF builds, has average training time complexity that is quasi-linear with the quantity of training data, O(k · n log(n) · C e · c · 2 ) for k trees, n training time series of length , C e similarity-based candidate splits, and c classes [31]. The term k comes from the number of trees to train and log(n) from the average depth of the trees.…”
Section: Complexitymentioning
confidence: 99%
See 1 more Smart Citation
“…Proximity Forests (PFs) define the new state-of-the-art in distance-based time series classification [58]. Like RFs, PFs are a set of tree classifiers but instead of CART trees, proximity trees are used.…”
Section: Classification Modelsmentioning
confidence: 99%
“…Thirdly, the histograms can be very large, so storing them all for each base classifier also requires a significant memory commitment for large problems. One proposed method to solve this, the BOSS vector space (BOSS-VS) classifier [20], has been shown to be significantly less accurate than the full BOSS [21,18]. We investigate whether we can mitigate against these problems without this significant loss in accuracy.…”
Section: Introductionmentioning
confidence: 99%