A hybrid decision tree/genetic algorithm method for data mining

Carvalho, Déborah Ribeiro

doi:10.1016/s0020-0255(03)00414-6

Cited by 27 publications

(30 citation statements)

References 7 publications

(11 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…[79] Table 6 concerns results from [79], where 10 different methods of automatic classifier construction based on data were considered. We chose 7 out of them for comparative analysis: C45R (C45Rules technique of generating a set M a n u s c r i p t of production rules from a tree produced by C4.5 algorithm) [83], RIPPER (improvement of the efficient Incremental Reduced Error Pruning algorithm) [84], MPLCS (a Memetic Pittsburgh Learning Classifier System) [85], AntMin+ (Ant Colony-based data miner to extract classification rules inspired by the research on the behaviour of real ant colonies) [86], CORE (COevolutionary Rule Extractor) [87], DTGA (hybrid decision tree/genetic algorithm discovering rules on small disjuncts) [88], and GFS-GP (Genetic Programming algorithm used to learn fuzzy rule-based classifiers) [89]. Three remaining methods of [79] were skipped: ILGA (Incremental Learning approach to Genetic Algorithms) [90], SLAVE (Structural Learning Algorithm on Vague Environment) [91], and the ICRM (Interpretable Classification Rule Mining algorithm) method proposed by the authors of [79].…”

Section: A Comparative Analysis With Alternative Techniquesmentioning

confidence: 99%

A multi-objective genetic optimization of interpretability-oriented fuzzy rule-based classifiers

Rudziński

2016

Applied Soft Computing

View full text Add to dashboard Cite

Section: A Comparative Analysis With Alternative Techniquesmentioning

confidence: 99%

A multi-objective genetic optimization of interpretability-oriented fuzzy rule-based classifiers

Rudziński

2016

Applied Soft Computing

View full text Add to dashboard Cite

“…There are many studies trying to improve classifying efficiency. Chang and Liu [11] and Carvalho and Freitas [8] applied decision tree for other method such as K-mean, fuzzy C mean. Chandra and Varghese [10] proposed a binary decision tree algorithm using the Gini index as a split measure.…”

Section: Fuzzy Decision Tree (Fdt)mentioning

confidence: 99%

“…Total cost (TC) is by defining and discussing the links between R&D cost, market capital, and design quality for market share in Eq. (8). Market share can be gained by attracting customers with preferences more distant from the target market.…”

Section: The Formulation Of Product Mix-experience Problems Considerimentioning

confidence: 99%

Fuzzy dual experience-based design evaluation model for integrating engineering design into customer responses

Chen

2016

Int J Interact Des Manuf

View full text Add to dashboard Cite

Experience-based design is a recently emerging method used to capture the emotional content of customer experiences. Both the engineer's experiences and customer's experiences for dual experiences are important in delivering high quality user-centred product design. To assess dual experiential design optimization, fuzzy decision tree and fuzzy cognitive map are integrated in engineering design perspectives. This study aims at optimizing complex interactions and experiential design system with imprecise relationships while quantifying the performance impact of engineering design efficiency on customer satisfaction. The experiment is conducted by utilizing sensitivity analysis of the three degrees of fuzzy membership function using a product mix-experience problem. The evaluation results show that this dual experience-based design approach can help R&D design, deliver high quality product development experiences and co-create value with customers to yield a high performance engineering design.

show abstract

“…The most common strategy is to evaluate every possible subtree, working from the leaves backward, for possible replacement by a terminal node. Nontrivial heuristics including genetic algorithms [114] have also been demonstrated as useful pruning strategies [115,116].…”

Section: Decision Trees and Random Forestsmentioning

confidence: 99%

Developing Best Practices for Descriptor‐Based Property Prediction: Appropriate Matching of Datasets, Descriptors, Methods, and Expectations

Krein

Huang

Morkowchuk

et al. 2012

Statistical Modelling of Molecular Descriptors in QSAR/QSPR

View full text Add to dashboard Cite

For well over 100 years, chemists have explored the relationship between the chemical structure and biological activity, and dreamed of predicting them as well as other measurable properties. The first description of a relationship between composition and activity [1] was based on observations of correlation between specific molecular features and observable physiochemical properties [2]. With some data tabulation, it was found that structure-activity relationships could be used to quantify chemical intuition: For a small change in the molecular structure, a corresponding small change in activity could be explained by analyzing regular changes the numerical representations of molecular structure. The power inherent in this type of relationship quickly became obvious, and increased in importance with the quick tabulation abilities of computers. The reductionist qualities of quantitative structureactivity relationships (QSARs) have resulted in both praise and condemnation for the discipline throughout its existence [3][4][5]. Without debating the philosophical validity of reductionist views, a more practical approach is to understand how and when QSARs are applicable to relevant problems. As discussed below, there are many choices to make when matching available data with types of chemical descriptors and machine learning methodologies (Figure 2.1). Inherent in these choices are decisions that affect the level of difficulty and computational effort needed to develop a model and to establish its domain of applicability -a crucial element for managing end-user expectations of model performance. Most models are constructed using methods that project or compress information into a simpler form, consequently representing a compromise between mode interpretability and predictive power. For any nontrivial QSAR the importance of good chemical descriptors cannot be overstated -even the most capable machine learning methodology cannot extract signal from descriptor variance that is not monotonically related to the endpoint of interest. This is the essential Tao of building QSARs, where the ultimate goal is to construct chemically meaningful, validated models. Achievement of this goal relies Statistical Modelling of Molecular Descriptors in QSAR/QSPR. First Edition. Edited

show abstract

A hybrid decision tree/genetic algorithm method for data mining

Cited by 27 publications

References 7 publications

A multi-objective genetic optimization of interpretability-oriented fuzzy rule-based classifiers

A multi-objective genetic optimization of interpretability-oriented fuzzy rule-based classifiers

Fuzzy dual experience-based design evaluation model for integrating engineering design into customer responses

Developing Best Practices for Descriptor‐Based Property Prediction: Appropriate Matching of Datasets, Descriptors, Methods, and Expectations

Contact Info

Product

Resources

About