2016
DOI: 10.1109/tsc.2015.2444838
|View full text |Cite
|
Sign up to set email alerts
|

Processing Cassandra Datasets with Hadoop-Streaming Based Approaches

Abstract: The progressive transition in the nature of both scientific and industrial datasets has been the driving force behind the development and research interests in the NoSQL model. Loosely structured data poses a challenge to traditional data store systems, and when working with the NoSQL model, these systems are often considered impractical and costly. As the quantity and quality of unstructured data grows, so does the demand for a processing pipeline that is capable of seamlessly combining the NoSQL storage mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 33 publications
(15 citation statements)
references
References 21 publications
0
15
0
Order By: Relevance
“…Another similar approach was presented in [6] and focused on reducing network overhead. To deal with the storage and processing of large media databases, MapReduce was applied along with a Java-based framework for Hadoop streaming in [8]. Cloud platform was used to efficiently deal with video processing in [14], [15].…”
Section: A Multimedia Managementmentioning
confidence: 99%
See 2 more Smart Citations
“…Another similar approach was presented in [6] and focused on reducing network overhead. To deal with the storage and processing of large media databases, MapReduce was applied along with a Java-based framework for Hadoop streaming in [8]. Cloud platform was used to efficiently deal with video processing in [14], [15].…”
Section: A Multimedia Managementmentioning
confidence: 99%
“…Cloud platform was used to efficiently deal with video processing in [14], [15]. All of the approaches presented in [5], [6], [8], [14], [15] focused on the cloud side, i.e., the server side and ignored the packet loss problems.…”
Section: A Multimedia Managementmentioning
confidence: 99%
See 1 more Smart Citation
“…Once placed in HDFS, the data can be used by Hadoop applications. Flume: Apache Flume is a extremely reliable service for accurate information collection and transition from autonomous computers to HDFS [11]. A number of flume agents that can cross a sequence of computers and places are often involved in information transportation.…”
Section: Data Cleaning and Data Injectionmentioning
confidence: 99%
“…Big Data is also associated with the new forms of digital data generated from social media and learning management systems offer researchers enormous opportunities for extracting meaning (Dede et al, 2016;Doorn, 2014; Lazer, Kennedy, King, and Vespignani, 2014) [33][34][35].…”
Section: Big Data Analytics and Data Sciencementioning
confidence: 99%