2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2016
DOI: 10.1109/ipdpsw.2016.141
|View full text |Cite
|
Sign up to set email alerts
|

SamzaSQL: Scalable Fast Data Management with Streaming SQL

Abstract: To stay competitive in today's data driven economy, enterprises large and small are turning to stream processing platforms to process high volume, high velocity, and diverse streams of data (fast data) as they arrive. Low-level programming models provided by the popular systems of today suffer from lack of responsiveness to change: enhancements require code changes with attendant large turn-around times. Even though distributed SQL query engines have been available for Big Data, we still lack support for SQL-b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…Besides, there are more SQL extensions for stream processing scenarios developed for certain systems, e.g., streaming SQL for Apache Kafka called KSQL [57], Continuous Computation Language (CCL) that is the extended SQL used in SAP HANA Smart Data Streaming [58], or SamzaSQL [59] as extended SQL for the DSPS Samza [60].…”
Section: Related Workmentioning
confidence: 99%
“…Besides, there are more SQL extensions for stream processing scenarios developed for certain systems, e.g., streaming SQL for Apache Kafka called KSQL [57], Continuous Computation Language (CCL) that is the extended SQL used in SAP HANA Smart Data Streaming [58], or SamzaSQL [59] as extended SQL for the DSPS Samza [60].…”
Section: Related Workmentioning
confidence: 99%
“…These requirements include the ability to (i) process continuous data on-the-fly without any requirement to store them, (ii) support high-level languages such as SQL, (iii) handle imperfections such as delayed, missing and out-of-order data, (iv) guarantee predictable and repeatable outcomes, (v) efficiently store, access, modify, and combine (with live streaming data) state information, (vi) ensure that the integrity of the data is maintained at all times and relevant applications are up and available despite failures, (vii) automatically and transparently distribute the data processing load across multiple processors and machines, and (viii) respond to high-volume data processing applications in real-time using a highly optimized execution path. Cutting edge DSMEs include Samza-SQL [32], KSQL, and SQLstream Blaze [30], [33].…”
Section: ) Data Stream Processing Layermentioning
confidence: 99%
“…The core of Samza consists of several fairly low-level abstractions, on top of which highlevel operators have been built (Pathirage et al 2016). However, the core abstractions have been carefully designed for operational robustness, and the scalability of Samza is directly attributable to the choice of these foundational abstractions.…”
Section: Overviewmentioning
confidence: 99%