2020
DOI: 10.1007/978-3-030-37051-0_85
|View full text |Cite
|
Sign up to set email alerts
|

RDD-Eclat: Approaches to Parallelize Eclat Algorithm on Spark RDD Framework

Abstract: Initially, a number of frequent itemset mining (FIM) algorithms have been designed on the Hadoop MapReduce, a distributed big data processing framework. But, due to heavy disk I/O, MapReduce is found to be inefficient for such highly iterative algorithms. Therefore, Spark, a more efficient distributed data processing framework, has been developed with in-memory computation and resilient distributed dataset (RDD) features to support the iterative algorithms. On the Spark RDD framework, Apriori and FP-Growth bas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 27 publications
(62 reference statements)
0
6
0
Order By: Relevance
“…RDD-Eclat [36] is a parallel Eclat algorithm entitled RDD-Eclat and the implementation of its five variations on the Spark RDD framework. EclatV1 is the first version, while the others are EclatV2, EclatV3, EclatV4, and EclatV5.…”
Section: Vertical Layout-based Algorithmsmentioning
confidence: 99%
“…RDD-Eclat [36] is a parallel Eclat algorithm entitled RDD-Eclat and the implementation of its five variations on the Spark RDD framework. EclatV1 is the first version, while the others are EclatV2, EclatV3, EclatV4, and EclatV5.…”
Section: Vertical Layout-based Algorithmsmentioning
confidence: 99%
“…The rule search space is effectively divided into subspace sets through concept lattice and equivalence relationships. The support calculation of each itemset does not require repeated retrieval of the entire dataset [16][17][18][19]. The main idea of using Eclat framework to study learning behaviors need the support of big data set of learning behaviors, through data transposition and standardization processing, we can get the itemsets and the transaction set.…”
Section: Frequent Itemsets Mining Based On Eclat Frameworkmentioning
confidence: 99%
“…horizontal database record or breadth first searching [6] and vertical database record [7][8] or depth first searching. When the horizontal record drawback issues are subjected to storage and memory, thus contemporary works are then utilized on the vertical database for rules mining algorithms that are proposed in [8][9][10]. In ARM, the so-called state-of-the-art frequent/infrequent models are Apriori [1,6] underlying on horizontal records.…”
Section: Related Workmentioning
confidence: 99%
“…To the best of our knowledge, Equivalent Class Transformation (Eclat) algorithm [8] outperforms because of its 'fast' intersection of its transaction-id-list to determine the minimum or maximum support threshold [9,14]. The Eclat followers and the invariants are [9][10][11][12][13], [15][16][17][18][19][20], [22] and [26].…”
Section: Related Workmentioning
confidence: 99%