2002
DOI: 10.1016/s1568-4946(02)00031-5
|View full text |Cite
|
Sign up to set email alerts
|

A genetic-algorithm for discovering small-disjunct rules in data mining

Abstract: This paper addresses the well-known classification task of data mining, where the goal is to discover rules predicting the class of examples (records of a given data set). In the context of data mining, small disjuncts are rules covering a small number of examples. Hence, these rules are usually error-prone, which contributes to a decrease in predictive accuracy. At first glance, this is not a serious problem, since the impact on predictive accuracy should be small. However, although each small disjunct covers… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
34
0
2

Year Published

2004
2004
2015
2015

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 55 publications
(36 citation statements)
references
References 13 publications
0
34
0
2
Order By: Relevance
“…This is a hybrid method that combines decision trees and genetic algorithms [5], [6], [3], [4]. The basic idea is to use a decision-tree algorithm to classify examples belonging to large disjuncts and use a genetic algorithm to discover rules classifying examples belonging to small disjuncts.…”
Section: A Hybrid Decision-tree / Genetic-algorithm Methods For Discovmentioning
confidence: 99%
“…This is a hybrid method that combines decision trees and genetic algorithms [5], [6], [3], [4]. The basic idea is to use a decision-tree algorithm to classify examples belonging to large disjuncts and use a genetic algorithm to discover rules classifying examples belonging to small disjuncts.…”
Section: A Hybrid Decision-tree / Genetic-algorithm Methods For Discovmentioning
confidence: 99%
“…• Carvalho and Freitas present in [8,9] a hybrid decision tree/genetic algorithm approach for a predictive rule induction process, which follows the following model: examples belonging to large disjuncts are classified by rules produced by a decision-tree algorithm while examples belonging to small disjuncts are classified by rules produced by a GA specifically designed for this task. In the GA each individual represents a small-disjunct rule identified by a decision tree leaf node.…”
Section: Rule Induction Using Genetic Algorithmsmentioning
confidence: 99%
“…The GA uses a new operator specially designed to improve the comprehensibility of the rules. The authors propose in [10] some modifications to the original GA. The most significant change is the use of a sequential niching method [7] to foster population diversity and avoid the GA convergence to a single rule.…”
Section: Rule Induction Using Genetic Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…The large disjunct are the rule sets which covers a large number of training instances and small disjuncts are the rules sets which covers small number of the training instances. The rule induction methods ignores small disjunct and discovers only large disjunct therefore the classification accuracy will significantly degraded in many circumstances [8,9,10].…”
Section: Introductionmentioning
confidence: 99%