2016
DOI: 10.1016/j.jocs.2016.07.002
|View full text |Cite
|
Sign up to set email alerts
|

Multithreaded and Spark parallelization of feature selection filters

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 35 publications
(20 citation statements)
references
References 6 publications
0
20
0
Order By: Relevance
“…The method seeks to identify and delete as much as possible irreparable and redundant information. It simplifies the data set both in size and in complexity of understanding, which leads to simpler and faster classification algorithms, better problem comprehension and reduced storage requirements (Eiras‐Franco et al, ; Phyu & Oo, ).…”
Section: Methodsmentioning
confidence: 99%
“…The method seeks to identify and delete as much as possible irreparable and redundant information. It simplifies the data set both in size and in complexity of understanding, which leads to simpler and faster classification algorithms, better problem comprehension and reduced storage requirements (Eiras‐Franco et al, ; Phyu & Oo, ).…”
Section: Methodsmentioning
confidence: 99%
“…The experiments tested and compared time-efficiency and scalability for the horizontal and vertical DiCFS approaches so as to check whether they improved on the original non-distributed version of the CFS. We also tested and compared execution times with that reported in the recently published research by Eiras-Franco et al [10] into a distributed version of CFS for regression problems.…”
Section: Methodsmentioning
confidence: 99%
“…We compare the two versions -DiCFS-hp and DiCFSvp, respectively -and also compare them with a baseline, represented by the classical non-distributed implementation of CFS in WEKA [17]. Finally, their benefits in terms of reduced execution time are compared with those of the CFS version developed by Eiras-Fanco et al [10] for regression problems. The results show that the time-efficiency and scalability of our two versions are an improvement on those of the original version of the CFS; furthermore, similar or improved execution times are obtained with respect to the Eiras-Franco et al [10] regression version.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations