2016
DOI: 10.1109/tbdata.2016.2622288
|View full text |Cite
|
Sign up to set email alerts
|

Fuzzy Based Scalable Clustering Algorithms for Handling Big Data Using Apache Spark

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
22
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 59 publications
(22 citation statements)
references
References 25 publications
0
22
0
Order By: Relevance
“…The space complexity of Fast Kernel Matrix Computation was higher. The Scalable Random Sampling with Iterative Optimization Fuzzy c-Means algorithm (SRSIO-FCM) was introduced in [15] to address the challenges involved during big data clustering. The clustering performance of SRSIO-FCM was not efficient therefore lacks clustering accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…The space complexity of Fast Kernel Matrix Computation was higher. The Scalable Random Sampling with Iterative Optimization Fuzzy c-Means algorithm (SRSIO-FCM) was introduced in [15] to address the challenges involved during big data clustering. The clustering performance of SRSIO-FCM was not efficient therefore lacks clustering accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…K. Peng et al [17] has proposed a clustering strategy for IDS dependent on Mini Batch K-means joined with important part examination. Initial, a pre-processing technique was proposed to digitize the strings and afterward the informational index was standardized in order to improve the clustering proficiency.…”
Section: Related Workmentioning
confidence: 99%
“…In‐memory processes enhance the effectiveness of program execution that the performance is superior to offline storage . However, in‐memory processing performance has scope for improvement; for example, the execution time of a certain type of instruction, such as reduceByKey() and groupByKey() in join operations, is longer than for general instructions . The data are sorted among offline storages depending on the time consumed.…”
Section: Introductionmentioning
confidence: 99%
“…The differences between Spark and Hadoop in intermediate data buffer result in high performance of iterative applications and interactive data mining with Spark. 17,25 Dharanipragada et al proposed Generate-Map-Reduce (GMR), which was an extension to MapReduce, to support iterative jobs and a distributed communication model by using shared data structures. GMR captured recursive computations by modeling iterative applications, such as simulated annealing and A* search.…”
mentioning
confidence: 99%