2017
DOI: 10.1016/j.jnca.2017.08.011
|View full text |Cite
|
Sign up to set email alerts
|

Cloud storage reliability for Big Data applications: A state of the art survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 97 publications
(47 citation statements)
references
References 52 publications
0
44
0
Order By: Relevance
“…Both techniques have their own trade-offs in various parameters such as durability, availability, storage overhead, network bandwidth and traffic, energy consumption and recovery performance. Future research should include the challenges involved in employing both techniques in Cloud storage systems for Big Data applications with respect to the aforementioned parameters [163]. This hybrid technique applies proactive dynamic data replication of erasure coded data based on node failure prediction, which significantly reduces network traffic and improves the performance of Big Data applications with less storage overhead.…”
Section: Reliabilitymentioning
confidence: 99%
“…Both techniques have their own trade-offs in various parameters such as durability, availability, storage overhead, network bandwidth and traffic, energy consumption and recovery performance. Future research should include the challenges involved in employing both techniques in Cloud storage systems for Big Data applications with respect to the aforementioned parameters [163]. This hybrid technique applies proactive dynamic data replication of erasure coded data based on node failure prediction, which significantly reduces network traffic and improves the performance of Big Data applications with less storage overhead.…”
Section: Reliabilitymentioning
confidence: 99%
“…RDDs achieve fault tolerance through the notion of lineage. Each RDD tracks the graph of transformations that was used to build it and reruns these operations on base data to reconstruct any lost partitions [25]. The other key concept in Spark is its DAG execution engine, which is similar to Tez and is our basis for extending the Tez model to Spark.…”
Section: System Architecture and Application Structurementioning
confidence: 99%
“…Reliability is defined in the context of resource failure, in the context of VM failure, in the context of service failure, or in the context of security. Nachiappan et al [25] have used reliability for cloud storage scheduling in big data. From the existing studies, very few reliability modelling is proposed for a cloud-based system.…”
Section: Related Workmentioning
confidence: 99%