2009
DOI: 10.1007/978-3-642-04180-8_15
|View full text |Cite
|
Sign up to set email alerts
|

On Subgroup Discovery in Numerical Domains

Abstract: Abstract. Subgroup discovery is a Knowledge Discovery task that aims at finding subgroups of a population with high generality and distributional unusualness. While several subgroup discovery algorithms have been presented in the past, they focus on databases with nominal attributes or make use of discretization to get rid of the numerical attributes. In this paper, we illustrate why the replacement of numerical attributes by nominal attributes can result in suboptimal results. Thereafter, we present a new sub… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(35 citation statements)
references
References 1 publication
(1 reference statement)
0
35
0
Order By: Relevance
“…Such restrictions usually compel the attribute domain to be nominal [1], [2], or impose an anti-monotonicity property on the quality measure which is then used to prune the search space [14]. We choose to free this paper from such restrictions, but notice that this is by no means essential to the described method.…”
Section: Definition (Subgroup)mentioning
confidence: 99%
“…Such restrictions usually compel the attribute domain to be nominal [1], [2], or impose an anti-monotonicity property on the quality measure which is then used to prune the search space [14]. We choose to free this paper from such restrictions, but notice that this is by no means essential to the described method.…”
Section: Definition (Subgroup)mentioning
confidence: 99%
“…(In fact, if statistical dependency were a monotonic property, correlated sets would capture also dependency rules, unless other extra heuristics were used.) In the context of classification rule search, there is even a globally optimal algorithm [16], which can handle small data sets, as long as the rule complexity is restricted.…”
Section: Handling Numerical and Periodic Variablesmentioning
confidence: 99%
“…Quality constraint is also one of the widely used pruning strategies for some subgroup discovery methods [22][23] . These methods prune the candidates that have a lower quality than a user-specified threshold value.…”
Section: Pruningmentioning
confidence: 99%