2018 IEEE 34th International Conference on Data Engineering (ICDE) 2018
DOI: 10.1109/icde.2018.00135
|View full text |Cite
|
Sign up to set email alerts
|

Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing

Abstract: Computing aggregates over windows is at the core of virtually every stream processing job. Typical stream processing applications involve overlapping windows and, therefore, cause redundant computations. Several techniques prevent this redundancy by sharing partial aggregates among windows. However, these techniques do not support out-of-order processing and session windows. Out-of-order processing is a key requirement to deal with delayed tuples in case of source failures such as temporary sensor outages. Ses… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 32 publications
(23 citation statements)
references
References 14 publications
0
23
0
Order By: Relevance
“…The insert(t, v) operation supports the case where t is already in the window, so it works with pre-aggregation schemes such as window panes [23], paired windows [22], cutty windows [13], or Scotty [30]. For instance, for a 5-hour sliding window that advances in 1-minute increments, the logical times can be rounded to minutes, leading to more cases where t is already in the window.…”
Section: Problem Statement: Ooo Swagmentioning
confidence: 99%
See 1 more Smart Citation
“…The insert(t, v) operation supports the case where t is already in the window, so it works with pre-aggregation schemes such as window panes [23], paired windows [22], cutty windows [13], or Scotty [30]. For instance, for a 5-hour sliding window that advances in 1-minute increments, the logical times can be rounded to minutes, leading to more cases where t is already in the window.…”
Section: Problem Statement: Ooo Swagmentioning
confidence: 99%
“…All of the following papers focus on sharing over streams with the same aggregation operator, e.g., monoid (S, ⊗, 1). The Scotty algorithm supports sliding-window aggregation over out-oforder streams, while sharing windows with both different sizes and slice granularities [30]. For instance, Scotty might share a window of size 60 minutes and granularity 3 minutes with a session window whose gap timeout is set to 5 minutes.…”
Section: Related Workmentioning
confidence: 99%
“…Another future direction is to extend slicing capabilities beyond deterministic windows (if possible) and cover cases of fully data-driven windows without FIFO guarantees such as ADWIN (Bifet and Gavalda 2007). Finally, general pre-aggregation data structures have to employ the notion of out-of-orderness (Traub et al 2018). Currently, with existing out-of-the-box solution such as FlatFat it is not possible to retract already evaluated window aggregates, thus, making it impossible to use for systems like Beam and Flink with out-of-order logic.…”
Section: Future Directionsmentioning
confidence: 99%
“…Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s [3,8,10,11]. To evaluate these techniques and the impact of out-of-order handling on the system performance, it is necessary to modify real-world data sets in order to meet specific data characteristics such as out-of-orderness and the addition of delay in event arrivals.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we present an out-of-order stream data generator, which enables the reproducible modification of out-of-order characteristics of arbitrary input data sets. This tool was already used in recent work on efficient window aggregation with General Stream Slicing [10,11] and is available in a public repository. 1 The data generator supports benchmark developers in two aspects.…”
Section: Introductionmentioning
confidence: 99%