2015
DOI: 10.1007/s10618-015-0403-4
|View full text |Cite
|
Sign up to set email alerts
|

Exceptional Model Mining

Abstract: Abstract. In most databases, it is possible to identify small partitions of the data where the observed distribution is notably different from that of the database as a whole. In classical subgroup discovery, one considers the distribution of a single nominal attribute, and exceptional subgroups show a surprising increase in the occurrence of one of its values. In this paper, we introduce Exceptional Model Mining (EMM), a framework that allows for more complicated target concepts. Rather than finding subgroups… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
63
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 94 publications
(64 citation statements)
references
References 55 publications
0
63
0
Order By: Relevance
“…. , t are available, and if we are not interested in finding unusual target distributions, but unusual target interactions, we can employ Exceptional Model Mining (EMM) (Duivesteijn 2013;Duivesteijn et al 2016) instead of SD. EMM is instantiated by selecting two things: a model class and a quality measure.…”
Section: Definition 3 (Quality Measure)mentioning
confidence: 99%
See 1 more Smart Citation
“…. , t are available, and if we are not interested in finding unusual target distributions, but unusual target interactions, we can employ Exceptional Model Mining (EMM) (Duivesteijn 2013;Duivesteijn et al 2016) instead of SD. EMM is instantiated by selecting two things: a model class and a quality measure.…”
Section: Definition 3 (Quality Measure)mentioning
confidence: 99%
“…Typically, subgroups are found by a level-wise search through attribute space (Duivesteijn 2013). However, we consider the exact search strategy to be a parameter of the algorithm.…”
Section: Search Strategymentioning
confidence: 99%
“…The algorithms were implemented in Python. The experiments were carried on an Intel Core i7-6700HQ 2.60 GHz machine with 16 GB RAM and were run by PyPy 5.4.1 For reproducibility purpose, the source code and the data are made available in our companion page 7 . These experiments aim to answer the following questions: Q1 -Is the closure over an HMT attribute more effective than mining closed itemsets?…”
Section: Empirical Studymentioning
confidence: 99%
“…The more different is the model, the more exceptional is the subgroup. Many models have been investigated in the last decade [21,8,7,13,6]. However, no model in the EMM framework makes it possible to characterize collection of individuals whose pairwise agreement exceptionally deviates according to a subset of objects.…”
Section: Introductionmentioning
confidence: 99%
“…In [4,11] different model classes. For example, if linear regression models are trained on the whole data set and different candidate subgroups, the quality of subgroups can be evaluated by comparing the regression coefficient between the global model and the local subgroup model.…”
Section: Related Workmentioning
confidence: 99%