2019
DOI: 10.1631/fitee.1800467
|View full text |Cite
|
Sign up to set email alerts
|

A non-group parallel frequent pattern mining algorithm based on conditional patterns

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…Figures 5 present the runtime of DT-DPM and both baseline MapReduce based models using big databases for solving both Itemset and Sequential Pattern Mining problems. The Table 2 Speedup of pattern mining algorithms with and without using DT-DPM framework using different mappers (2,4,8,16,32) Problem Database Without DT-DPM With DT-DPM Mappers 2 4 8 1 6 3 2 2 4 8 1 6 3 2 p u m s b 2 3 7 1 1 3 4 5 9 1 5 1 9 3 5 m u s h r o o m 3 8 1 0 1 3 3 6 6 1 1 1 9 2 2 3 baseline methods used is FiDoop-DP [15], and NG-PFP: NonGroup Parallel Frequent Pattern mining [67] for itemset mining, and PrefixSpan-S [66] for sequence mining. The results reveal that our model outperforms the baseline MapReduce based models in terms of computational time for both itemset and sequence mining.…”
Section: Results On Big Databasesmentioning
confidence: 99%
See 1 more Smart Citation
“…Figures 5 present the runtime of DT-DPM and both baseline MapReduce based models using big databases for solving both Itemset and Sequential Pattern Mining problems. The Table 2 Speedup of pattern mining algorithms with and without using DT-DPM framework using different mappers (2,4,8,16,32) Problem Database Without DT-DPM With DT-DPM Mappers 2 4 8 1 6 3 2 2 4 8 1 6 3 2 p u m s b 2 3 7 1 1 3 4 5 9 1 5 1 9 3 5 m u s h r o o m 3 8 1 0 1 3 3 6 6 1 1 1 9 2 2 3 baseline methods used is FiDoop-DP [15], and NG-PFP: NonGroup Parallel Frequent Pattern mining [67] for itemset mining, and PrefixSpan-S [66] for sequence mining. The results reveal that our model outperforms the baseline MapReduce based models in terms of computational time for both itemset and sequence mining.…”
Section: Results On Big Databasesmentioning
confidence: 99%
“…However, it is very sensitive to the data distribution. Kuang et al [67] proposed the parallel implementation of FP-Growth algorithm in Hadoop by removing the data redundancy between the different data partitions, which allows to handle the transactions in a single pass. Sumalatha et al [68] introduces the concept of distributed temporal high utility sequential patterns, and propose an intelligent strategy by creating a time interval utility data structure for evaluating the candidate patterns.…”
Section: Related Workmentioning
confidence: 99%
“…However, this involved multiple splits of transactions to be generated increasing the data to be transferred during shuffling. In this regard, NG‐PFP 30 eliminated the formation of G$$ G $$list$$ list $$ from F$$ F $$list$$ list $$ as a means of reducing the shuffling cost to improve the efficiency.…”
Section: Related Workmentioning
confidence: 99%
“…An incremental utility-based pattern mining algorithm was put forth here. To extract data in big datasets a Binary based Technique was designed [4]. Threads were collaborated to generate frequent itemsets in a big data environment.…”
Section: Introductionmentioning
confidence: 99%