2009
DOI: 10.1007/s10618-009-0136-3
|View full text |Cite
|
Sign up to set email alerts
|

On subgroup discovery in numerical domains

Abstract: Subgroup discovery is a Knowledge Discovery task that aims at finding subgroups of a population with high generality and distribu-tional unusualness. While several subgroup discovery algorithms have been presented in the past, they focus on databases with nominal attributes or make use of discretization to get rid of the numerical attributes. In this paper, we illustrate why the replacement of numerical attributes by nominal attributes can result in suboptimal results. Thereafter , we present a new subgroup di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
35
0

Year Published

2009
2009
2019
2019

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 64 publications
(35 citation statements)
references
References 12 publications
0
35
0
Order By: Relevance
“…Atzmüller and Lemmerich (2009) and Grosskreutz and Rüping (2009) that proposes a range of quality measures for numeric domains (Pieters et al 2010). Our experiments with multiple target attributes are of course based on our previous work on EMM (Leman et al 2008;Duivesteijn et al 2010;van Leeuwen 2010).…”
Section: Related Workmentioning
confidence: 99%
“…Atzmüller and Lemmerich (2009) and Grosskreutz and Rüping (2009) that proposes a range of quality measures for numeric domains (Pieters et al 2010). Our experiments with multiple target attributes are of course based on our previous work on EMM (Leman et al 2008;Duivesteijn et al 2010;van Leeuwen 2010).…”
Section: Related Workmentioning
confidence: 99%
“…In [7], the discretization happens within the algorithm and relies on a property of the function measuring subgroup quality to merge basic intervals in a bottom-up fashion. Yet, the cut points for the basic intervals are determined as a pre-processing step in a way that is not necessarily optimal with respect to their later use.…”
Section: Data Discretizationmentioning
confidence: 99%
“…Each of the initial pairs is extended in turn (lines [3][4][5][6][7][8][9][10][11][12][13][14], selecting at each step the k i most promising candidates (line 12). A value of 4 for k i , for example, enables to keep the first step candidates for both operators and both sides.…”
Section: Putting It All Togethermentioning
confidence: 99%
“…In "Subgroup discovery in numerical domains" Grosskreutz and Rüping (2009) propose an improved way of identifying subgroups for continuous-attribute data with an extensive comparison to existing approaches.…”
Section: Papers Appearing In the Journal Of Data Mining And Knowledgementioning
confidence: 99%