2008
DOI: 10.1109/tkde.2007.190695
|View full text |Cite
|
Sign up to set email alerts
|

DryadeParent, An Efficient and Robust Closed Attribute Tree Mining Algorithm

Abstract: In this paper, we present a new tree mining algorithm, DRYADEPARENT, based on the hooking principle first introduced in DRYADE. In the experiments, we demonstrate that the branching factor and depth of the frequent patterns to find are key factors of complexity for tree mining algorithms, even if often overlooked in previous work. We show that DRYADEPARENT outperforms the current fastest algorithm, CMTreeMiner, by orders of magnitude on data sets where the frequent tree patterns have a high branching factor.In… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2009
2009
2020
2020

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 40 publications
(28 citation statements)
references
References 23 publications
0
28
0
Order By: Relevance
“…In this paper, we mainly studied a group of pattern classes called rigid patterns [3,4,6,7,12,17,18,22], with closure mappings C. On the other hand, although classes of flexible patterns [5,28,27] have no notion of closure mappings C, they still allow efficient depth-first search. Thus, it an interesting future problem to extend the approach of this paper to these classes.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this paper, we mainly studied a group of pattern classes called rigid patterns [3,4,6,7,12,17,18,22], with closure mappings C. On the other hand, although classes of flexible patterns [5,28,27] have no notion of closure mappings C, they still allow efficient depth-first search. Thus, it an interesting future problem to extend the approach of this paper to these classes.…”
Section: Resultsmentioning
confidence: 99%
“…The problem of closed set mining (CIM) is one of the most extensively studied topics for years that attracts many researchers from both practical and theoretical views [3,4,5,6,7,11,12,17,18,22,25,26,28,27,31]. A closed itemset (a closed sets, for short) is a representative among an equivalence class of itemsets that have the same set of occurrences in common in a given transaction database D. The collection C of all closed sets in D contains the same information as the original collection F of frequent itemsets, while the size of C is possibly much smaller than F. Hence, from the engineering point of view, CIM is considered useful for increasing the comprehensibility and the efficiency of FIM [26,31].…”
Section: Closed Pattern Mining Frequent Itemset Miningmentioning
confidence: 99%
See 1 more Smart Citation
“…In the database literature, it is possible to find many proposals of algorithms to extract frequent structures both from graph-based data representations [36], [18], [15] and tree-based data representations [37], [35], [2], [25], [26], [17], [16], [23], [17]. In this paper, we focus on tree mining since XML documents are represented with a treeshaped structure.…”
Section: Comparison With Other Workmentioning
confidence: 99%
“…More recently, in order to deal with semi-structured data, more expressive patterns than itemsets have been heavily investigated [7,5]. Again closure operators have been used to efficiently extract closed patterns, as for instance attribute trees [1,11]. Still the number of closed patterns is often too high when dealing with realworld data, and some way to select them has to be found [6].…”
Section: Introductionmentioning
confidence: 99%