Ingolf Geist scite author profile

Scalable data mining in large databases is one of today's real challenges to database research area. The integration of data mining with database systems is an essential component for any successful largescale data mining application. A fundamental component in data mining tasks is finding frequent patterns in a given dataset. Most of the previous studies adopt an Apriori-like candidate set generation-and-test approach. However, candidate set generation is still costly, especially when there exist prolific patterns and/or long patterns. In this study we present an evaluation of SQL based frequent pattern mining with a novel frequent pattern growth (FP-growth) method, which is efficient and scalable for mining both long and short patterns without candidate generation. We examine some techniques to improve performance. In addition, we have made performance evaluation on DBMS with IBM DB2 UDB EEE V8.1. It is costly to handle a huge number of candidate sets.

show abstract

A Database-Supported Workbench for Information Fusion: InFuse

Dunemann

Geist

Jesse

et al. 2002

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ingolf Geist

SQL based frequent pattern mining without candidate generation

Towards Optimization of Hybrid CPU/GPU Query Plans in Database Systems

Autonomous query-driven index tuning

Concept-based querying in mediator systems

A framework for data mining and KDD

Supporting Similarity Operations Based on Approximate String Matching on the Web

SQL Based Frequent Pattern Mining with FP-Growth

A Database-Supported Workbench for Information Fusion: InFuse

Contact Info

Product

Resources

About