2021
DOI: 10.2991/ijcis.d.210212.001
|View full text |Cite
|
Sign up to set email alerts
|

Fuzzy Hoeffding Decision Tree for Data Stream Classification

Abstract: Data stream mining has recently grown in popularity, thanks to an increasing number of applications which need continuous and fast analysis of streaming data. Such data are generally produced in application domains that require immediate reactions with strict temporal constraints. These particular characteristics make problematic the use of classical machine learning algorithms for mining knowledge from these fast data streams and call for appropriate techniques. In this paper, based on the well-known Hoeffdin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(13 citation statements)
references
References 39 publications
0
11
0
Order By: Relevance
“…On the other hand, the second group of data sets was built based on real occupancy data set, which is available in the repository of the University of California in Irvine (UCI) [17]. There are three files in the directory downloaded from the mentioned source: one is a training data set and the two others are test data sets [21]. Due to the fact that in this study the k-fold cross validation method was used, those files were merged, and then based on that, the test and training sets were created.…”
Section: Evaluation: Platform Data Sets Methods and Metricsmentioning
confidence: 99%
“…On the other hand, the second group of data sets was built based on real occupancy data set, which is available in the repository of the University of California in Irvine (UCI) [17]. There are three files in the directory downloaded from the mentioned source: one is a training data set and the two others are test data sets [21]. Due to the fact that in this study the k-fold cross validation method was used, those files were merged, and then based on that, the test and training sets were created.…”
Section: Evaluation: Platform Data Sets Methods and Metricsmentioning
confidence: 99%
“…Moreover, fuzzy clustering has been preferred to hard clustering, due to its capability to better represent changes in data, which is a critical factor for stream data [1]. Indeed, for this reason, several extensions of fuzzy clustering algorithms have been proposed for data stream [10,17,27].…”
Section: Methodsmentioning
confidence: 99%
“…In this paper, we exploit two incremental decision trees suitable for data stream mining and classification, namely the Hoeffding Decision Tree (HDT) [19] and its fuzzy extension (Fuzzy Hoeffding Decision Tree -FHDT) introduced in [8] and deeply experimented in [9]. Although a tree is inherently interpretable, as stated above, fuzzy trees have been introduced because their usage of linguistic partitions on the attributes makes the resulting rules more explainable, given that each edge exiting a node can be associated with a proper and meaningful linguistic term.…”
Section: B Incremental Decision Treesmentioning
confidence: 99%
“…In fact, by using this strategy, only one rule is used to make a classification decision. More details on FHDTs can be found in [8] [9].…”
Section: B Incremental Decision Treesmentioning
confidence: 99%
See 1 more Smart Citation