Adaptive Stream Processing using Dynamic Batch Sizing

Das, Tamal; Zhong, Yang; Stoica, Ion; Shenker, Scott

doi:10.1145/2670979.2670995

Cited by 101 publications

(78 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Lohrmann et al [37] adaptively adjusted the buffer sizes and performs task chaining according to the QoS constraints. Das et al [23] used dynamic batch sizing for stream processing in Apache Spark to avoid queuing delays as the input data rate changes. Venkataraman et al [50] dynamically adjusted the number of batches that were grouped together for scheduling.…”

Section: Related Workmentioning

confidence: 99%

Towards automatic parameter tuning of stream processing systems

Bilal

Canini

2017

Proceedings of the 2017 Symposium on Cloud Computing

View full text Add to dashboard Cite

CitationBilal ABSTRACTOptimizing the performance of big-data streaming applications has become a daunting and time-consuming task: parameters may be tuned from a space of hundreds or even thousands of possible configurations. In this paper, we present a framework for automating parameter tuning for stream-processing systems. Our framework supports standard black-box optimization algorithms as well as a novel gray-box optimization algorithm. We demonstrate the multiple benefits of automated parameter tuning in optimizing three benchmark applications in Apache Storm. Our results show that a hill-climbing algorithm that uses a new heuristic sampling approach based on Latin Hypercube provides the best results. Our gray-box algorithm provides comparable results while being two to five times faster.

show abstract

Section: Related Workmentioning

confidence: 99%

Towards automatic parameter tuning of stream processing systems

Bilal

Canini

2017

Proceedings of the 2017 Symposium on Cloud Computing

View full text Add to dashboard Cite

show abstract

“…Storm [66] uses Zookeeper [6] to coordinate backpressure across nodes. Das et al [19] propose dynamically adjusting batch sizes to improve latency and throughput. Contrary to these approaches, Wisp's rate limiting approach does not assume knowledge of the full service topology, dynamically computes rate limits based on measured resource utilization instead of static thresholds, is multi-tenant aware, and does not require centralized coordination.…”

Section: Related Workmentioning

confidence: 99%

Distributed resource management across process boundaries

Suresh

Bodík

Menache

et al. 2017

Proceedings of the 2017 Symposium on Cloud Computing

View full text Add to dashboard Cite

Multi-tenant distributed systems composed of small services, such as Service-oriented Architectures (SOAs) and Micro-services, raise new challenges in attaining high performance and efficient resource utilization. In these systems, a request execution spans tens to thousands of processes, and the execution paths and resource demands on different services are generally not known when a request first enters the system. In this paper, we highlight the fundamental challenges of regulating load and scheduling in SOAs while meeting end-to-end performance objectives on metrics of concern to both tenants and operators. We design Wisp, a framework for building SOAs that transparently adapts rate limiters and request schedulers systemwide according to operator policies to satisfy end-to-end goals while responding to changing system conditions. In evaluations against production as well as synthetic workloads, Wisp successfully enforces a range of end-to-end performance objectives, such as reducing average latencies, meeting deadlines, providing fairness and isolation, and avoiding system overload.

show abstract

“…foreach Oi do 6 Let p ik be the node in Si that has the largest OF increase δ ik ; 7 P ← P ∪ {p ik }; Si ← Si − {p ik }; 8 usage = N ; 9 if P = ∅ & N > R then return P; 10 while usage < R do 11 Candidates ← ∅; 12 foreach Oi do 13 Let p ik be the node in Si that has the largest OF increase δ ik ;…”

Section: Algorithm 4: Planfulltopology(p R T )mentioning

confidence: 99%

“…Input: The amount of available resources R; Topology T ; Output: Partial replication plan P; 1 Initialize: decompose the complete topology T into sub-topologies: T S1, T S2, ... ; 2 P ← ∅, SA ← ∅, usage ← 0; 3 if R < Number of operators in T then 4 Return P ; 5 foreach Sub-Topology T Si do 6 Ni ← Number of operators in T Si;…”

Section: Algorithm 5: Structureaware(rt )mentioning

confidence: 99%

Tolerating correlated failures in Massively Parallel Stream Processing Engines

Zhou

2016

2016 IEEE 32nd International Conference on Data Engineering (ICDE)

View full text Add to dashboard Cite

Abstract-Fault-tolerance techniques for stream processing engines can be categorized into passive and active approaches. A typical passive approach periodically checkpoints a processing task's runtime states and can recover a failed task by restoring its runtime state using its latest checkpoint. On the other hand, an active approach usually employs backup nodes to run replicated tasks. Upon failure, the active replica can take over the processing of the failed task with minimal latency. However, both approaches have their own inadequacies in Massively Parallel Stream Processing Engines (MPSPE). The passive approach incurs a long recovery latency especially when a number of correlated nodes fail simultaneously, while the active approach requires extra replication resources. In this paper, we propose a new faulttolerance framework, which is Passive and Partially Active (PPA). In a PPA scheme, the passive approach is applied to all tasks while only a selected set of tasks will be actively replicated. The number of actively replicated tasks depends on the available resources. If tasks without active replicas fail, tentative outputs will be generated before the completion of the recovery process. We also propose effective and efficient algorithms to optimize a partially active replication plan to maximize the quality of tentative outputs. We implemented PPA on top of Storm, an open-source MPSPE and conducted extensive experiments using both real and synthetic datasets to verify the effectiveness of our approach.

show abstract

Adaptive Stream Processing using Dynamic Batch Sizing

Cited by 101 publications

References 19 publications

Towards automatic parameter tuning of stream processing systems

Towards automatic parameter tuning of stream processing systems

Distributed resource management across process boundaries

Tolerating correlated failures in Massively Parallel Stream Processing Engines

Contact Info

Product

Resources

About