Paul Leng scite author profile

Classification Association Rule Mining (CARM) systems operate by applying an Association Rule Mining (ARM) method to obtain classification rules from a training set of previously classified data. The rules thus generated will be influenced by the choice of ARM parameters employed by the algorithm (typically support and confidence threshold values). In this paper we examine the effect that this choice has on the predictive accuracy of CARM methods. We show that the accuracy can almost always be improved by a suitable choice of parameters, and describe a hill-climbing method for finding the best parameter settings. We also demonstrate that the proposed hill-climbing method is most effective when coupled with a fast CARM algorithm such as the TFPC algorithm which is also described.

show abstract

Threshold Tuning for Improved Classification Association Rule Mining

Coenen

Leng

Zhang

2005

View full text Add to dashboard Cite

Abstract. One application of Association Rule Mining (ARM) is to identify Classification Association Rules (CARs) that can be used to classify future instances from the same population as the data being mined. Most CARM methods first mine the data for candidate rules, then prune these using coverage analysis of the training data. In this paper we describe a CARM algorithm that avoids the need for coverage analysis, and a technique for tuning its threshold parameters to obtain more accurate classification. We present results to show this approach can achieve better accuracy than comparable alternatives at lower cost.

show abstract

An Evaluation of Approaches to Classification Rule Selection

Coenen

Leng

View full text Add to dashboard Cite

show abstract

Algorithms for Computing Association Rules Using a Partial-Support Tree

Goulbourne

Coenen

Leng

2000

View full text Add to dashboard Cite

Computing Association Rules Using Partial Totals

Coenen

Goulbourne

Leng

2001

View full text Add to dashboard Cite

The problem of extracting all association rules from within a binary database is well-known. Existing methods may involve multiple passes of the database, and cope badly with densely-packed database records because of the combinatorial explosion in the number of sets of attributes for which incidencecounts must be computed. We describe here a class of methods we have introduced that begin by using a single database pass to perform a partial computation of the totals required, storing these in the form of a set enumeration tree, which is created in time linear to the size of the database. Algorithms for using this structure to complete the count summations are discussed, and a method is described, derived from the well-known Apriori algorithm. Results are presented demonstrating the performance advantage to be gained from the use of this approach.

show abstract

Obtaining Best Parameter Values for Accurate Classification

Coenen

Leng

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Paul Leng

Tree Structures for Mining Association Rules

Data structure for association rule mining: T-trees and P-trees

The effect of threshold values on association rule based classification accuracy

Threshold Tuning for Improved Classification Association Rule Mining

An Evaluation of Approaches to Classification Rule Selection

Algorithms for Computing Association Rules Using a Partial-Support Tree

Computing Association Rules Using Partial Totals

Obtaining Best Parameter Values for Accurate Classification

Contact Info

Product

Resources

About