2008
DOI: 10.1007/s10115-008-0136-4
|View full text |Cite
|
Sign up to set email alerts
|

One in a million: picking the right patterns

Abstract: Constrained pattern mining extracts patterns based on their individual merit. Usually this results in far more patterns than a human expert or a machine leaning technique could make use of. Often different patterns or combinations of patterns cover a similar subset of the examples, thus being redundant and not carrying any new information. To remove the redundant information contained in such pattern sets, we propose two general heuristic algorithms-Bouncer and Picker-for selecting a small subset of patterns. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
30
0

Year Published

2009
2009
2019
2019

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 33 publications
(30 citation statements)
references
References 13 publications
0
30
0
Order By: Relevance
“…Another exciting question is whether our results on the optimality of supervised feature selection can be transfered to techniques for unsupervised feature selection on frequent subgraphs [5]. We are positive that this is possible (S. Nijssen, personal communication (2008)).…”
Section: Discussionmentioning
confidence: 92%
“…Another exciting question is whether our results on the optimality of supervised feature selection can be transfered to techniques for unsupervised feature selection on frequent subgraphs [5]. We are positive that this is possible (S. Nijssen, personal communication (2008)).…”
Section: Discussionmentioning
confidence: 92%
“…Top-K most similar pattern use for the pattern-based classification is a common strategy [1,14]. Other strategies, such as, similarity between patterns [2] or emerging patterns [4], are used for the optimal pattern selection. In this paper we define similarity metric for pattern and context which incorporates hierarchical attribute structure.…”
Section: Related Workmentioning
confidence: 99%
“…Pre-pruning relational data has been a very promising research topic (Bringmann and Zimmermann, 2009). Cohen (1995) introduced a method to filter irrelevant literals out of relational examples in a text mining context.…”
Section: Related Workmentioning
confidence: 99%