2015
DOI: 10.1007/s12293-015-0173-y
|View full text |Cite
|
Sign up to set email alerts
|

Genetic programming for feature construction and selection in classification on high-dimensional data

Abstract: Classification on high-dimensional data with thousands to tens of thousands of dimensions is a challenging task due to the high dimensionality and the quality of the feature set. The problem can be addressed by using feature selection to choose only informative features or feature construction to create new high-level features. Genetic programming (GP) using a tree-based representation can be used for both feature construction and implicit feature selection. This work presents a comprehensive study to investig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
52
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 153 publications
(57 citation statements)
references
References 33 publications
(39 reference statements)
0
52
0
Order By: Relevance
“…The use of GP for visualisation of solutions for production scheduling problems has also been recently investigated [13]. GP has also been applied to other tangential unsupervised learning tasks for feature creation, such as clustering [7], as well as extensive use in supervised learning domains [20,12]. Clearly, GP has shown significant potential as a feature construction method, and so it is hoped that it can be extended to directly perform manifold learning as well.…”
Section: Related Workmentioning
confidence: 99%
“…The use of GP for visualisation of solutions for production scheduling problems has also been recently investigated [13]. GP has also been applied to other tangential unsupervised learning tasks for feature creation, such as clustering [7], as well as extensive use in supervised learning domains [20,12]. Clearly, GP has shown significant potential as a feature construction method, and so it is hoped that it can be extended to directly perform manifold learning as well.…”
Section: Related Workmentioning
confidence: 99%
“…Feature construction for high-dimensional datasets is considered in [16], for eight bio-medical binary classification problems, with 2,000 to 24,188 features. This approach is different from the typical ones, as the authors propose to use SGP to evolve classifiers rather than features, and extract features from the components (subtrees) of such classifiers.…”
Section: Related Workmentioning
confidence: 99%
“…FS and FC have seen wide application in supervised learning tasks [41,39], including EC-based FS [43] and FC [32,42]. FS and FC have also been used in unsupervised learning tasks [12], but little research has considered EC-based approaches for these tasks [2].…”
Section: Dimensionality Reductionmentioning
confidence: 99%