2015 IEEE 5th International Conference on Electronics Information and Emergency Communication 2015
DOI: 10.1109/iceiec.2015.7284549
|View full text |Cite
|
Sign up to set email alerts
|

Study on GSP algorithm based on Hadoop

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…This algorithm was called Ha-GSP. 42 Another GSP variant was implemented on Spark by Yu et al, 43 where they introduced two different database partitioning solutions for the unbalanced loading issue. They initially loaded the database from HDFS to Spark RDDs (Resilient Distributed Datasets), and saved the interim results in the RDDs to minimize input-output overhead.…”
Section: Parallelize the Spade Algorithm With The Recursive Dynamic Loadmentioning
confidence: 99%
See 1 more Smart Citation
“…This algorithm was called Ha-GSP. 42 Another GSP variant was implemented on Spark by Yu et al, 43 where they introduced two different database partitioning solutions for the unbalanced loading issue. They initially loaded the database from HDFS to Spark RDDs (Resilient Distributed Datasets), and saved the interim results in the RDDs to minimize input-output overhead.…”
Section: Parallelize the Spade Algorithm With The Recursive Dynamic Loadmentioning
confidence: 99%
“…In Hadoop, Li et al presented an algorithm that utilizes divide and conquer capabilities provided in the MapReduce programming model. This algorithm was called Ha‐GSP 42 . Another GSP variant was implemented on Spark by Yu et al, 43 where they introduced two different database partitioning solutions for the unbalanced loading issue.…”
Section: Fundamental Concepts and Literature Reviewmentioning
confidence: 99%
“…Sequential Pattern Mining algorithms are also implemented and parallelized on Hadoop and Spark to get rid of the load and serial operation overhead. [20][21][22]48,49 Yu et al proposed a distributed GSP (DGSP) algorithm based on MapReduce on Hadoop. 20 The DGSP algorithm partitions the database and assign jobs to map workers and so optimizes the workload balance.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Li et al proposed Ha-GSP on Hadoop by using Map and Reduce functions of MapReduce. 21 Yu et al implemented the GSP algorithm on Spark. 22 For the imbalance load problem, two different database partitioning strategies were proposed.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation