2001
DOI: 10.1007/3-540-45329-6_4
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Implementation of Decision Tree Learning Algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(9 citation statements)
references
References 1 publication
0
9
0
Order By: Relevance
“…For example, Ben-Haim et al proposed a new algorithm called streaming parallel decision tree (SPDT). It builds the decision tree using horizontal parallelism for large datasets [24,25].…”
Section: Related Workmentioning
confidence: 99%
“…For example, Ben-Haim et al proposed a new algorithm called streaming parallel decision tree (SPDT). It builds the decision tree using horizontal parallelism for large datasets [24,25].…”
Section: Related Workmentioning
confidence: 99%
“…We can categorize big data approaches to decision tree induction as follows: building one big tree (Andrzejak et al, 2013;Panda et al, 2009;Ntoutsi et al, 2008;Zhang and Jiang, 2012;Pawlik and Augsten, 2011;Narlikar, 1998;Sreenivas et al, 2000;Goil and Choudhary, 2001;Amado et al, 2001;Domingos and Hulten, 2000;Dai and Ji, 2014), transferring all decision trees into one rule base and back into a decision tree, ensemble approaches (Louppe and Geurts, 2012;Hansen and Salamon, 1990;Sollich and Krogh, 1996;Breiman, 1999), and others (e.g., Kargupta and Park, 2004) that do not build a new tree and use a combination of tree results. According to Ben-Haim and Tom-Tov (2010), another way to categorize the different types of algorithms for handling large datasets is to divide them into the following two groups: pre-sorting of data and using approximate representations of data.…”
Section: Background and Related Workmentioning
confidence: 99%
“…In this setting, the increasingly available parallel computing resources can be leveraged in two ways. First, proposals have been made to use these resources to improve the runtime performance of greedy search algorithms, i.e., to find the same results faster [2,6]. A second use of parallel computing resources that has been studied is to improve the task-based performance of greedy algorithms by addressing their shortcomings, namely their tendency to get caught in local extrema.…”
Section: Introductionmentioning
confidence: 99%