2011
DOI: 10.1002/widm.51
|View full text |Cite
|
Sign up to set email alerts
|

Knowledge discovery in data streams with regression tree methods

Abstract: This paper presents an advanced review of regression tree methods for mining data streams. Batch regression tree methods are known for their simplicity, interpretability, accuracy, and efficiency. They use fast divide‐and‐conquer greedy algorithms that recursively partition the given training data into smaller subsets. The result is a tree‐shaped model with splitting rules in the internal nodes and predictions in the leaves. Most batch regression tree methods take a complete dataset and build a model using tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
18
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 35 publications
(18 citation statements)
references
References 42 publications
(58 reference statements)
0
18
0
Order By: Relevance
“…Finally, a recent article [Moreno-Torres et al 2012] focuses on describing various ways how data distribution can change over time and only briefly covers adaptation techniques from dataset shift community perspective, mostly leaving out works on concept drift. A recent review [Alberg et al 2012] focuses on decision trees.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, a recent article [Moreno-Torres et al 2012] focuses on describing various ways how data distribution can change over time and only briefly covers adaptation techniques from dataset shift community perspective, mostly leaving out works on concept drift. A recent review [Alberg et al 2012] focuses on decision trees.…”
Section: Introductionmentioning
confidence: 99%
“…Most batch regression and tree models for predicting numerical variables, such as MARS [7], CART [3], RETIS [10], M5 [12], M5P [14], SMOTI [5], MAUVE [13], MOPT [1], GUIDE [11], and FIMT [9], are not designed for aggregated temporal data. Hence, they cannot utilize the relationship between multiple statistical moments (such as mean and standard deviation) of aggregated numerical attributes.…”
Section: Related Workmentioning
confidence: 99%
“…However, according to Breiman et al [4], these methods are known for their split instability. Finally, the interested reader may find a more detailed survey of regression tree methods in [1].…”
Section: Related Workmentioning
confidence: 99%
“…Incremental approaches, on the other hand, face the problem of dealing with continuous, and potentially computationally expensive, updates to the regression tree model. Recently, the hybrid approach of batch-incremental model building, where the model is updated using batches of new instances, typically using a sliding window, has become the most common approach [13].…”
Section: Related Workmentioning
confidence: 99%