2020
DOI: 10.1007/978-981-15-2071-6_34
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Study of Clustering Algorithms for Big Data Mining with MapReduce Capability

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…Some authors (Fahad et al. , 2014; Pandey et al. , 2020) defined the clustering algorithm characteristics for big data using volume, variety and velocity characteristics.…”
Section: Related Work Of Initialization Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Some authors (Fahad et al. , 2014; Pandey et al. , 2020) defined the clustering algorithm characteristics for big data using volume, variety and velocity characteristics.…”
Section: Related Work Of Initialization Methodsmentioning
confidence: 99%
“…Some authors (Fahad et al, 2014;Pandey et al, 2020) defined the clustering algorithm characteristics for big data using volume, variety and velocity characteristics. The volume Clustering for big data mining characteristic defines the scale, outliers and dimensionality of the dataset; variety specifies the data forms, cluster structure and shape; and velocity represents the computing efficiency through time complexity and scalability of the clustering algorithm.…”
Section: Analysis Of Existing Initialization Algorithm For Big Data C...mentioning
confidence: 99%
“…The volume, variety and velocity characteristics were utilized by the literature 2,69,70 to define the clustering algorithm characteristics for big data. The volume characteristic dictates the size, dimensionality and outliers of the data.…”
Section: Related Workmentioning
confidence: 99%
“…Janghel et al (Pandey et al, 2020) focused on correct diagnosing of breast cancer from the UCI repository dataset, by employing thirteen machine learning models and comparing them to the various measures. The results showed that AdaBoost, LR, and KNN models were promising high accuracy of 98% in experimenting with all the models.…”
Section: Breast Cancer Diseasementioning
confidence: 99%