2006
DOI: 10.1007/11871637_10
|View full text |Cite
|
Sign up to set email alerts
|

Don’t Be Afraid of Simpler Patterns

Abstract: Abstract. This paper investigates the trade-off between the expressiveness of the pattern language and the performance of the pattern miner in structured data mining. This trade-off is investigated in the context of correlated pattern mining, which is concerned with finding the k-best patterns according to a convex criterion, for the pattern languages of itemsets, multi-itemsets, sequences, trees and graphs. The criteria used in our investigation are the typical ones in data mining: computational cost and pred… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
86
0

Year Published

2008
2008
2016
2016

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 48 publications
(86 citation statements)
references
References 13 publications
0
86
0
Order By: Relevance
“…The first approach generates frequent outerplanar subgraphs based on the BBP subgraph isomorphism using the mining algorithm FOG presented in [8]. The second approach generates frequent subgraphs based on the general subgraph isomorphism, using an efficient implementation [21] of the gSpan algorithm [12]. In both cases, the bit-vector encodes the occurrence of the frequent patterns.…”
Section: Methodsmentioning
confidence: 99%
“…The first approach generates frequent outerplanar subgraphs based on the BBP subgraph isomorphism using the mining algorithm FOG presented in [8]. The second approach generates frequent subgraphs based on the general subgraph isomorphism, using an efficient implementation [21] of the gSpan algorithm [12]. In both cases, the bit-vector encodes the occurrence of the frequent patterns.…”
Section: Methodsmentioning
confidence: 99%
“…a) E-mail: saigo@bio.kyutech.ac.jp DOI: 10.1587/transinf.E96.D.1766 More advanced approaches attempt to mine discriminative graphs by using the labels of examples as external information source to prune the search space [8]. Correlation or Information gain are typically employed to estimate the informativeness of patterns and prune uninteresting patterns.…”
Section: Introductionmentioning
confidence: 99%
“…Graph mining systems then perform a complete search through the entire graph space, enumerating all subgraphs satisfying these constraints (Yan and Han 2002;Bringmann et al 2006) or even exhaustively enumerating all possible subgraphs (Wale et al 2008).…”
Section: Introductionmentioning
confidence: 99%
“…While these approaches offer strong guarantees w.r.t. completeness or optimality of the found patterns, they have a high computational cost and require post-processing to deal, for example, with redundancy issues (Bringmann et al 2006). In this way, local pattern mining acts as a complex, expensive and indirect approach to generate features for graphs.…”
Section: Introductionmentioning
confidence: 99%