Genetic-Algorithm-Based Instance and Feature Selection

Ishibuchi, Hisao; Nakashima, Tomoharu; Nii, Manabu

doi:10.1007/978-1-4757-3359-4_6

Cited by 19 publications

(13 citation statements)

References 9 publications

(12 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We test the suitability of the approach on di↵erent datasets and compare the performance achieved to that of existing CBM algorithms from the literature. Previous works are mainly focused on reducing either the number of redundant cases or noisy cases [1,7,8,[20][21][22], or aimed at selecting attributes [10,12] or to both enhance the accuracy and reduce the size of the case-base [11]. However, the fitness function proposed in this work measures the redundancy of the case-base, the number of noisy cases and the error rate of the system.…”

Section: Discussionmentioning

confidence: 99%

A Multi-Objective Evolutionary Algorithm Fitness Function for Case-Base Maintenance

Lupiani

Craw

Massie

et al. 2013

Case-Based Reasoning Research and Development

View full text Add to dashboard Cite

CopyrightItems in 'OpenAIR@RGU', Robert Gordon University Open Access Institutional Repository, are protected by copyright and intellectual property law. If you believe that any material held in 'OpenAIR@RGU' infringes copyright, please contact openair-help@rgu.ac.uk with details. The item will be removed from the repository while the claim is investigated. Abstract. Case-Base Maintenance (CBM) has two important goals. On the one hand, it aims to reduce the size of the case-base. On the other hand, it has to improve the accuracy of the CBR system. CBM can be represented as a multi-objective optimization problem to achieve both goals. Multi-Objective Evolutionary Algorithms (MOEAs) have been recognised as appropriate techniques for multi-objective optimisation because they perform a search for multiple solutions in parallel. In the present paper we introduce a fitness function based on the Complexity Profiling model to perform CBM with MOEA, and we compare its results against other known CBM approaches. From the experimental results, CBM with MOEA shows regularly good results in many case-bases, despite the amount of redundant and noisy cases, and with a significant potential for improvement.

show abstract

Section: Discussionmentioning

confidence: 99%

A Multi-Objective Evolutionary Algorithm Fitness Function for Case-Base Maintenance

Lupiani

Craw

Massie

et al. 2013

Case-Based Reasoning Research and Development

View full text Add to dashboard Cite

show abstract

“…An initial population of chromosomes are selected either randomly or with some heuristics, and the selection-crossover-mutation operations are applied to the chromosomes to obtain better solutions. Ishibuchi [13] attempted to apply the GA to both feature and instance selection simultaneously and did obtain some good results, but the study is limited to the problem domain where the nearest neighbor method is applied for classification analysis on numeric data. In short, it appears that existing studies on applying the GA for data reduction tend to be task-specific (classification in particular), and whether and how the GA techniques can be used for general purpose data reduction remains to be investigated.…”

Section: Adaptive Sampling Proceduresmentioning

confidence: 99%

“…The application of the GAs for feature selection has been explored in [5,13,23], among others. Since feature selection is often associated with classification problems, the fitness functions specified in these studies are more or less a function of classification error rate.…”

Section: Adaptive Sampling Proceduresmentioning

confidence: 99%

See 1 more Smart Citation

Adaptive data reduction for large-scale transaction data

Jacob

2008

European Journal of Operational Research

View full text Add to dashboard Cite

“…Finally, a related area of research is that of the use of GAs for feature and instance selection in, for example, data mining. A number of references to this literature can be found in [10].…”

Section: Introductionmentioning

confidence: 99%

Genetic algorithms for outlier detection and variable selection in linear regression models

Tolvi

2003

Soft Computing

View full text Add to dashboard Cite

This article addresses some problems in outlier detection and variable selection in linear regression models. First, in outlier detection there are problems known as smearing and masking. Smearing means that one outlier makes another, non-outlier observation appear as an outlier, and masking that one outlier prevents another one from being detected. Detecting outliers one by one may therefore give misleading results. In this article a genetic algorithm is presented which considers different possible groupings of the data into outlier and non-outlier observations. In this way all outliers are detected at the same time. Second, it is known that outlier detection and variable selection can influence each other, and that different results may be obtained, depending on the order in which these two tasks are performed. It may therefore be useful to consider these tasks simultaneously, and a genetic algorithm for a simultaneous outlier detection and variable selection is suggested. Two real data sets are used to illustrate the algorithms, which are shown to work well. In addition, the scalability of the algorithms is considered with an experiment using generated data.

show abstract

Genetic-Algorithm-Based Instance and Feature Selection

Cited by 19 publications

References 9 publications

A Multi-Objective Evolutionary Algorithm Fitness Function for Case-Base Maintenance

A Multi-Objective Evolutionary Algorithm Fitness Function for Case-Base Maintenance

Adaptive data reduction for large-scale transaction data

Genetic algorithms for outlier detection and variable selection in linear regression models

Contact Info

Product

Resources

About