2017
DOI: 10.1002/widm.1207
|View full text |Cite
|
Sign up to set email alerts
|

A survey of itemset mining

Abstract: Itemset mining is an important subfield of data mining, which consists of discovering interesting and useful patterns in transaction databases. The traditional task of frequent itemset mining is to discover groups of items (itemsets) that appear frequently together in transactions made by customers. Although itemset mining was designed for market basket analysis, it can be viewed more generally as the task of discovering groups of attribute values frequently cooccurring in databases. Because of its numerous ap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
137
0
5

Year Published

2017
2017
2022
2022

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 193 publications
(156 citation statements)
references
References 111 publications
1
137
0
5
Order By: Relevance
“…The aim of this article is to show the improvements addressed during the last 25 years, that is, since the FIM task was first described (Agrawal et al, ). While some reviews have been already proposed in literature (Chee, Jaafar, Aziz, Hasan, & Yeoh, ; Fournier‐Viger et al, ), they are mainly focused on sequential exhaustive search approaches and on describing the algorithms for nonexpert users. In this sense, it is our understanding that an analysis from an expert point of view that involves any existing methodology (exhaustive and nonexhaustive search) on any architecture (centralized and parallel) is necessary to comprehend which improvements have been proposed over time.…”
Section: Lesson Learnedmentioning
confidence: 99%
See 1 more Smart Citation
“…The aim of this article is to show the improvements addressed during the last 25 years, that is, since the FIM task was first described (Agrawal et al, ). While some reviews have been already proposed in literature (Chee, Jaafar, Aziz, Hasan, & Yeoh, ; Fournier‐Viger et al, ), they are mainly focused on sequential exhaustive search approaches and on describing the algorithms for nonexpert users. In this sense, it is our understanding that an analysis from an expert point of view that involves any existing methodology (exhaustive and nonexhaustive search) on any architecture (centralized and parallel) is necessary to comprehend which improvements have been proposed over time.…”
Section: Lesson Learnedmentioning
confidence: 99%
“…Since 1993, when the first FIM algorithm was released (Agrawal et al, 1993), a special attention has been given to the performance of novel algorithms in this field (Borgelt, 2012;Fournier-Viger, Lin, Vo, Truong et al, 2017). Nowadays, 25 years later, extremely large datasets can be analyzed in a few seconds and this is not only a matter of novel architectures and hardware progresses but also a consequence of the proposed algorithmic solutions.…”
Section: Introductionmentioning
confidence: 99%
“…3. It follows breadth-first search approach which is quite costly in terms of memory utilization [41].…”
Section: Utility Of Itemset (U) = Internal Utility (I) * External Utimentioning
confidence: 99%
“…In many real‐world applications, data mining techniques are used to extract interesting patterns from databases, to support crucial decision‐making. Two fundamental tasks for revealing interesting relationships between items in transactional databases are frequent itemset mining (FIM) and association rule mining (ARM) (Agrawal, Imielinski, & Swami, ; Chen, Han, & Yu, ; Fournier‐Viger et al, ). The most well‐known ARM algorithms are Apriori (Agrawal & Srikant, ) and FP‐Growth (Han, Pei, Yin, & Mao, ).…”
Section: Introductionmentioning
confidence: 99%
“…To the best of our knowledge, this is the first survey on the mining task of incremental high‐utility itemset mining. The methods discussed in this article are not only important for iHUIM (Ahmed et al, ; Fournier‐Viger et al, ; Lin et al, ), but can also serve as inspiration for other data mining tasks (Fournier‐Viger et al, ), including incremental data mining (Hong et al, ) and dynamic data mining (Lin et al, ). The major contributions of this paper are threefold. A taxonomy of the most common approaches for mining HUIs in static databases, including Apriori‐based, tree‐based, projection‐based, hybrid, and other approaches, is presented.…”
Section: Introductionmentioning
confidence: 99%