2017
DOI: 10.1007/s13748-017-0117-5
|View full text |Cite
|
Sign up to set email alerts
|

MR-DIS: democratic instance selection for big data by MapReduce

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(10 citation statements)
references
References 22 publications
0
10
0
Order By: Relevance
“…In particular, we can find approaches based on k‐NN for Big Data such as Peralta et al () where FS is performed on huge datasets using the k‐NN algorithm within an evolutionary approach, or a distributed Spark‐based version of the ReliefF algorithm Palma‐Mendoza, Rodriguez, and de‐Marcos (). In Arnaiz‐González, González‐Rogel, Díez‐Pastor, and López‐Nozal () a parallel implementation of the Democratic IS algorithm (DIS) is presented, called MR‐DIS. The idea of DIS algorithm is to apply a classic IS algorithm over a number of equally sized partitions of the training data.…”
Section: The K‐nn Algorithm As a Tool To Transform Big Data Into Smarmentioning
confidence: 99%
“…In particular, we can find approaches based on k‐NN for Big Data such as Peralta et al () where FS is performed on huge datasets using the k‐NN algorithm within an evolutionary approach, or a distributed Spark‐based version of the ReliefF algorithm Palma‐Mendoza, Rodriguez, and de‐Marcos (). In Arnaiz‐González, González‐Rogel, Díez‐Pastor, and López‐Nozal () a parallel implementation of the Democratic IS algorithm (DIS) is presented, called MR‐DIS. The idea of DIS algorithm is to apply a classic IS algorithm over a number of equally sized partitions of the training data.…”
Section: The K‐nn Algorithm As a Tool To Transform Big Data Into Smarmentioning
confidence: 99%
“…In [16] discussed a Democratic Instance Selection (DIS) algorithm for parallel implementation. DIS algorithm achieved less computational complexity, linearity in the number of instances and intuitively parallelized internal configuration.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The observed fact about big data processing is the increased computational complexity because of the high volume [21]. The analysis of big data in supervised classification is based on the learning algorithms, and after that, it finds the appropriate classes for the datasets [22].…”
Section: Introductionmentioning
confidence: 99%