2019
DOI: 10.1186/s40537-019-0238-8
|View full text |Cite
|
Sign up to set email alerts
|

Positive and negative association rule mining in Hadoop’s MapReduce environment

Abstract: Association rule mining, originally developed by [3], is a well-known data mining technique used to find associations between items or itemsets. In today's big data environment, association rule mining has to be extended to big data. The Apriori algorithm is one of the most commonly used algorithms for association rule mining [4]. Using the Apriori algorithm, we find frequent patterns, that is, patterns that occur frequently in data. The Apriori algorithm employs an iterative approach where k-itemsets are used… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 24 publications
(16 citation statements)
references
References 26 publications
(25 reference statements)
0
14
0
Order By: Relevance
“…Because the data size and complexity increase rapidly, an efficient method is required for mining such rules. Most of the past studies focused on positive association rule mining; therefore, S. Bagui [26] proposed a method to efficiently discover both positive and negative association rules, simultaneously, from big data, using Hadoop's MapReduce architecture.…”
Section: B Distributed Algorithms For Discovery Of Frequent Patternsmentioning
confidence: 99%
“…Because the data size and complexity increase rapidly, an efficient method is required for mining such rules. Most of the past studies focused on positive association rule mining; therefore, S. Bagui [26] proposed a method to efficiently discover both positive and negative association rules, simultaneously, from big data, using Hadoop's MapReduce architecture.…”
Section: B Distributed Algorithms For Discovery Of Frequent Patternsmentioning
confidence: 99%
“…In this study, we evaluated whether co-injections were associated with low SORs and if production regions were associated with high solution gas oil ratios (SGORs). The association rule had three parameters: support, confidence, and lift [46,47]. In this context, the association rule can be written as:…”
Section: Association Rulementioning
confidence: 99%
“…The data integration capability of Hadoop enables it to integrate data from different sources, in different formats and characters into data lakes that support data input and output between multiple data sources and databases [21]. Bagui and Dhar [22] created a data warehouse using the Hadoop MapReduce framework on the Amazon AWS EMR. The data warehouse had 1.5 GB real life transactional dataset from 1.7 million web html documents mainly written in English and sourced from several websites.…”
Section: Big Data Modelsmentioning
confidence: 99%