2017 IEEE International Conference on Big Data (Big Data) 2017
DOI: 10.1109/bigdata.2017.8258196
|View full text |Cite
|
Sign up to set email alerts
|

Towards a unified storage and ingestion architecture for stream processing

Abstract: Abstract-Big Data applications are rapidly moving from a batch-oriented execution model to a streaming execution model in order to extract value from the data in real-time. However, processing live data alone is often not enough: in many cases, such applications need to combine the live data with previously archived data to increase the quality of the extracted insights. Current streaming-oriented runtimes and middlewares are not flexible enough to deal with this trend, as they address ingestion (collection an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 24 publications
0
9
0
Order By: Relevance
“…With the wide spread of big data technology this highspeed data processing become more popular, but there are limitations on bandwidth and latency for real time or near real time processing. That is why there are some gaps between an analytics system and a real end-to-end system (Wang et al, 2017;Marcu et al, 2017;Petrov&Valov, 2019).…”
Section: Creating a Data Model With The Analytical Tool Apache Kudumentioning
confidence: 99%
“…With the wide spread of big data technology this highspeed data processing become more popular, but there are limitations on bandwidth and latency for real time or near real time processing. That is why there are some gaps between an analytics system and a real end-to-end system (Wang et al, 2017;Marcu et al, 2017;Petrov&Valov, 2019).…”
Section: Creating a Data Model With The Analytical Tool Apache Kudumentioning
confidence: 99%
“…Failures can cause the loss of large amounts of data streams which may lead to erroneous analytic results. Ingestion systems should incorporate high-availability mechanisms that allow to operate continuously for a long time in spite of failures [5]. It is, therefore, desirable for ingestion systems to offer the desired degree of robustness in handling failures while minimizing data loss [7].…”
Section: B Fault Tolerance and Message Delivery Guaranteesmentioning
confidence: 99%
“…A typical streaming analytics system is built on top of a three layers stack that include ingestion, processing, and storage components [5]. The ingestion layer is the entry point to the streaming architecture.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…None of the state-of-art ingestion systems are designed to leverage data locality optimizations as envisioned with KerA in a unified storage and ingestion architecture [19]. Moreover, thanks to its network agnostic implementation [7], KerA can benefit from emerging fast networks and RDMA, providing more efficient reads and writes than using TCP/IP.…”
Section: Related Workmentioning
confidence: 99%