2003
DOI: 10.1007/3-540-36599-0_36
|View full text |Cite
|
Sign up to set email alerts
|

Genetic Programming for Attribute Construction in Data Mining

Abstract: Abstract. For a given data set, its set of attributes defines its data space representation. The quality of a data space representation is one of the most important factors influencing the performance of a data mining algorithm. The attributes defining the data space can be inadequate, making it difficult to discover highquality knowledge. In order to solve this problem, this paper proposes a Genetic Programming algorithm developed for attribute construction. This algorithm constructs new attributes out of the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
53
0

Year Published

2006
2006
2021
2021

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 73 publications
(55 citation statements)
references
References 7 publications
0
53
0
Order By: Relevance
“…It is well known that the classification performance is dependent of the quality of the data space representation (i.e., attributes of the data) [19], [20]. If the attributes defining the data space-the set of indicators in the case of EDDIE-are inadequate, it becomes more difficult to create GDTs with a high predictive quality.…”
Section: Incorporating Attribute Construction In Eddiementioning
confidence: 99%
See 1 more Smart Citation
“…It is well known that the classification performance is dependent of the quality of the data space representation (i.e., attributes of the data) [19], [20]. If the attributes defining the data space-the set of indicators in the case of EDDIE-are inadequate, it becomes more difficult to create GDTs with a high predictive quality.…”
Section: Incorporating Attribute Construction In Eddiementioning
confidence: 99%
“…Hu [20] proposed a GP for attribute construction called GPCI following the filter strategy. The terminal set of the GP consists of the booleanized original attributes and the function set consists of the AND and OR operators; each individual represent a candidate new attribute, created by combining the (now boolean) attributes using the AND and OR operators.…”
Section: Incorporating Attribute Construction In Eddiementioning
confidence: 99%
“…Examples of this approach are the GP algorithms for attribute construction proposed by (Otero et al 2003;Hu 1998), whose attribute evaluation function (the fitness function) is the information gain ratio -a measure discussed in detail in (Quinlan 1993). In addition, (Muharram & Smith 2004) did experiments comparing the effectiveness of two different attributeevaluation criteria in GP for attribute construction -viz.…”
Section: Data Preprocessing Vs Interleaving Approachmentioning
confidence: 99%
“…In [17], the authors use typed GP for building feature extractors. Functions are arithmetic and relational operators.…”
Section: Related Workmentioning
confidence: 99%