2020
DOI: 10.1007/978-3-030-46939-9_6
|View full text |Cite
|
Sign up to set email alerts
|

Performance Analysis of Apache Spark MLlib Clustering on Batch Data Stored in Cassandra

Abstract: With the tremendous increase in the amount of data being generated from variety of sources there is a need of efficient data storage and processing techniques. Some of the sources generating this large amount of data are Weather Sensors, Scientific experiments, etc. This huge voluminous data is termed as BigData. Due to ever-increasing amount of data there is a demand for faster data ingestion and processing. Apache Spark, a dominant processing tool is a publicly available platform for processing outsized data… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 5 publications
(5 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?