2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST) 2010
DOI: 10.1109/msst.2010.5496977
|View full text |Cite
|
Sign up to set email alerts
|

Scalable storage support for data stream processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2011
2011
2014
2014

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…As a result, Big Data analysis necessitates tremendously time-consuming navigation through a gigantic search space to provide guidelines and obtain feedback from users. Thus, Sebepou and Magoutis [ 87 ] proposed a scalable system of data streaming with a persistent storage path. This path influences the performance properties of a scalable streaming system slightly.…”
Section: Life Cycle and Management Of Data Using Technologies And mentioning
confidence: 99%
“…As a result, Big Data analysis necessitates tremendously time-consuming navigation through a gigantic search space to provide guidelines and obtain feedback from users. Thus, Sebepou and Magoutis [ 87 ] proposed a scalable system of data streaming with a persistent storage path. This path influences the performance properties of a scalable streaming system slightly.…”
Section: Life Cycle and Management Of Data Using Technologies And mentioning
confidence: 99%
“…the information about the point in time from where to replay tuples in case of failure) is not maintained mixing regular output tuples with checkpoint tuples but rather using output tuples header (finer granularity and lower overhead); (2) the earliest timestamp in maintained on-line in StreamCloud, avoiding thus unnecessary read operations in the parallel file system to retrieve its value and finally, (3) [SM11] do not consider dynamic setups (elasticity) nor stateful operators garbage collection mechanisms. The authors of [SM11] leverage previous work [SM10] on how to efficiently persist a stream connecting two data streaming operators relying on a parallel file system. StreamCloud leverages and improves on this work by providing a better way to persist streams adopting a self-identifying naming convention for the persisted information; thus avoiding metadata maintenance as in [SM10] and reducing the runtime protocol impact (about 20ms in the proposed work to approximately 1ms in StreamCloud, as presented in 5.6.2).…”
Section: Fault Tolerance Techniquesmentioning
confidence: 99%
“…The authors of [SM11] leverage previous work [SM10] on how to efficiently persist a stream connecting two data streaming operators relying on a parallel file system. StreamCloud leverages and improves on this work by providing a better way to persist streams adopting a self-identifying naming convention for the persisted information; thus avoiding metadata maintenance as in [SM10] and reducing the runtime protocol impact (about 20ms in the proposed work to approximately 1ms in StreamCloud, as presented in 5.6.2).…”
Section: Fault Tolerance Techniquesmentioning
confidence: 99%