SPL

Hirzel, Martin; Schneider, Scott; Gedik, Buğra

doi:10.1145/3039207

Cited by 17 publications

(4 citation statements)

References 78 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our implementation does not directly use SIMD instructions, but the C++ optimizing compiler sometimes uses them automatically. We did not implement partitioning but it is straightforward: when the aggregate is partitioned by key, keep disjoint state, i.e., a separate tree for each key; that would enable fission [12] for parallelization, either user-directed or automatically. Previous work describes an algorithm for range queries [22], and that algorithm also works in the presence of bulk insertion and eviction.…”

Section: Methodsmentioning

confidence: 99%

“…In theory, we expect that since the data is in-order, bulk insert brings no additional advantage over looping over single inserts. In practice, all algorithms improve in throughput as 𝑚 increases from 2 0 to around 2 12 . This may be because fewer top-level insertions means fewer memory fences, even for algorithms that emulate bulk insert with loops.…”

Section: Throughputmentioning

confidence: 96%

See 1 more Smart Citation

Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions

Tangwongsan,

Hirzel,

Schneider

2023

Proc. VLDB Endow.

Self Cite

View full text Add to dashboard Cite

Sliding-window aggregation is a foundational stream processing primitive that efficiently summarizes recent data. The state-of-the-art algorithms for sliding-window aggregation are highly efficient when stream data items are evicted or inserted one at a time, even when some of the insertions occur out-of-order. However, real-world streams are often not only out-of-order but also bursty, causing data items to be evicted or inserted in larger bulks. This paper introduces a new algorithm for sliding-window aggregation with bulk eviction and bulk insertion. For the special case of single insert and evict, our algorithm matches the theoretical complexity of the best previous out-of-order algorithms. For the case of bulk evict, our algorithm improves upon the theoretical complexity of the best previous algorithm for that case and also outperforms it in practice. For the case of bulk insert, there are no prior algorithms, and our algorithm improves upon the naive approach of emulating bulk insert with a loop over single inserts, both in theory and in practice. Overall, this paper makes high-performance algorithms for sliding window aggregation more broadly applicable by efficiently handling the ubiquitous cases of out-of-order data and bursts.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Throughputmentioning

confidence: 96%

Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions

Tangwongsan,

Hirzel,

Schneider

2023

Proc. VLDB Endow.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Architectural models [42], SPEs [43,13], and engines for certain application scenarios such as IoT are emerging. Architecture that mixes elements deployed on edge computing resources and the cloud is provided in the literature [43,44,42].…”

Section: Second: Distributed Executionmentioning

confidence: 99%

“…Stream Processing Language (SPL) offers a language and engine for composing distributed and parallel data-flow graphs and a toolkit for building generic operators [44]. It provides language constructs and compiler optimisations that utilise the performance of the Stream Processing Core (SPC) [68].…”

Section: Other Solutionsmentioning

confidence: 99%

Distributed data stream processing and edge computing: A survey on resource elasticity and future directions

Assuno

Veith

Buyya

2018

Journal of Network and Computer Applications

254

View full text Add to dashboard Cite

Under several emerging application scenarios, such as in smart cities, operational monitoring of large infrastructure, wearable assistance, and Internet of Things, continuous data streams must be processed under very short delays. Several solutions, including multiple software engines, have been developed for processing unbounded data streams in a scalable and efficient manner. More recently, architecture has been proposed to use edge computing for data stream processing. This paper surveys state of the art on stream processing engines and mechanisms for exploiting resource elasticity features of cloud computing in stream processing. Resource elasticity allows for an application or service to scale out/in according to fluctuating demands. Although such features have been extensively investigated for enterprise applications, stream processing poses challenges on achieving elastic systems that can make efficient resource management decisions based on current load. Elasticity becomes even more challenging in highly distributed environments comprising edge and cloud computing resources. This work examines some of these challenges and discusses solutions proposed in the literature to address them.

show abstract

StreamB: A Declarative Language for Automatically Processing Data Streams in Abstract Environments for Agent Platforms

Ferrando

Papacchini

2022

Engineering Multi-Agent Systems

View full text Add to dashboard Cite

SPL

Cited by 17 publications

References 78 publications

Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions

Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions

Distributed data stream processing and edge computing: A survey on resource elasticity and future directions

StreamB: A Declarative Language for Automatically Processing Data Streams in Abstract Environments for Agent Platforms

Contact Info

Product

Resources

About