2017
DOI: 10.1109/tpds.2016.2560176
|View full text |Cite
|
Sign up to set email alerts
|

FiDoop-DP: Data Partitioning in Frequent Itemset Mining on Hadoop Clusters

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
30
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 63 publications
(30 citation statements)
references
References 29 publications
0
30
0
Order By: Relevance
“…MapReduce is a software framework that is designed for cluster computing . The architecture of cluster systems is a multiprocessing system that connects multiple hosts and has high computing power and reliability to meet the needs of various types of applications . MapReduce and the Hadoop distributed file system (HDFS) are two cores of this framework to achieve decentralized computing .…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…MapReduce is a software framework that is designed for cluster computing . The architecture of cluster systems is a multiprocessing system that connects multiple hosts and has high computing power and reliability to meet the needs of various types of applications . MapReduce and the Hadoop distributed file system (HDFS) are two cores of this framework to achieve decentralized computing .…”
Section: Related Workmentioning
confidence: 99%
“…Figure illustrates the HDFS architecture, in which the coordinator NameNode schedules the processes with the metadata and the processes are assigned to a cluster of DataNodes . All the data are split into several blocks and stored in different DataNodes, and each block in other nodes has several replications . When a program requires access to a file, NameNode coordinates the relevant DataNode to respond and NameNode moves the files stored in the HDFS and simultaneously copies them to the other DataNodes.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…FIUT is another technique for mining frequent itemsets. It is exceptionally productive system for frequent itemset mining(FIM) called as Frequent Itemset Ultrametric Tree (FIUT) [4] [11]. It has two main phases of scans of database.…”
Section: Related Workmentioning
confidence: 99%
“…Unfortunately, pattern mining techniques for large databases, such as FIM, suffer from long processing time (runtime). To reduce the runtime of pattern mining, several optimization techniques have been proposed [2,3]. However, these optimization techniques are incapable of dealing with databases containing a huge number of items, where only few of the relevant patterns are displayed to the end user.…”
Section: Introductionmentioning
confidence: 99%