2017
DOI: 10.1007/s10618-017-0547-5
|View full text |Cite
|
Sign up to set email alerts
|

Anytime discovery of a diverse set of patterns with Monte Carlo tree search

Abstract: The discovery of patterns that accurately discriminate one class label from another remains a challenging data mining task. Subgroup discovery (SD) is one of the frameworks that enables to elicit such interesting patterns from labeled data. A question remains fairly open: How to select an accurate heuristic search technique when exhaustive enumeration of the pattern space is infeasible? Existing approaches make use of beam-search, sampling, and genetic algorithms for discovering a pattern set that is nonredund… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
6

Relationship

3
3

Authors

Journals

citations
Cited by 28 publications
(49 citation statements)
references
References 52 publications
0
49
0
Order By: Relevance
“…This may come from the fact that deep patterns are not diverse enough to be selected by the post-processing step which in turn suggests that, when the timeout is reached, the diversity of the mined pattern is not high enough. To further increase this diversity, stochastic search methods such as Monte-Carlo Tree Search [2] could be integrated in our algorithm.…”
Section: Methodsmentioning
confidence: 99%
“…This may come from the fact that deep patterns are not diverse enough to be selected by the post-processing step which in turn suggests that, when the timeout is reached, the diversity of the mined pattern is not high enough. To further increase this diversity, stochastic search methods such as Monte-Carlo Tree Search [2] could be integrated in our algorithm.…”
Section: Methodsmentioning
confidence: 99%
“…For instance, in Fig. 1, the domain of m 1 is {1, 2, 3, 4} and the intent [2,4], [1,3] (see the definition of interval patterns later) denotes a subgroup whose extent is {g 3 , g 4 , g 5 , g 6 }.…”
Section: Problem Definitionmentioning
confidence: 99%
“…Heuristic [2,15] and exhaustive [1,5] solutions have been proposed for subgroup discovery. Usually, these approaches consider a set of nominal attributes with a binary label.…”
Section: Related Workmentioning
confidence: 99%
“…A quality measure has to capture discrepancies in the target label distribution between the selected subset of objects and the overall dataset. A large panel of exhaustive [1,10] and heuristic [5,16] subgroup discovery algorithms have been proposed so far. Most of these approaches consider a set of nominal attributes with a binary label.…”
Section: Introductionmentioning
confidence: 99%