Proceedings of the 16th Annual Middleware Conference 2015
DOI: 10.1145/2814576.2814808
|View full text |Cite
|
Sign up to set email alerts
|

R-Storm

Abstract: The era of big data has led to the emergence of new systems for real-time distributed stream processing, e.g., Apache Storm is one of the most popular stream processing systems in industry today. However, Storm, like many other stream processing systems lacks an intelligent scheduling mechanism. The default round-robin scheduling currently deployed in Storm disregards resource demands and availability, and can therefore be inefficient at times. We present R-Storm (Resource-Aware Storm), a system that implement… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0
3

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 161 publications
(31 citation statements)
references
References 21 publications
0
23
0
3
Order By: Relevance
“…Several approaches for optimising the initial task assignment or scheduling exploit intra-query parallelism by ensuring that certain operators can scale horizontally to support larger numbers of incoming tuples, thus achieving greater throughput. R-Storm [82] handles the problem of task assignment in Apache Storm by providing custom resourceaware scheduling schemes. Under the considered approach, each task in a Storm topology has soft CPU and bandwidth requirements and a hard memory requirement.…”
Section: Static Techniquesmentioning
confidence: 99%
“…Several approaches for optimising the initial task assignment or scheduling exploit intra-query parallelism by ensuring that certain operators can scale horizontally to support larger numbers of incoming tuples, thus achieving greater throughput. R-Storm [82] handles the problem of task assignment in Apache Storm by providing custom resourceaware scheduling schemes. Under the considered approach, each task in a Storm topology has soft CPU and bandwidth requirements and a hard memory requirement.…”
Section: Static Techniquesmentioning
confidence: 99%
“…T-Storm [13] requires that each node has only one available worker for each topology to avoid the inter-worker traffic inside one node which is only suitable for lightly loaded topologies. There is a fair number of research work on addressing heterogeneity in terms of resource and input rate in DSP [9,12,33,34]. For instance, Buddhika et al [33] introduce an on-line scheduler to mitigate the impact of interference on the performance of DSP.…”
Section: B Operator Placement and Task Scheduling In Dspmentioning
confidence: 99%
“…There are two primary yet complementary ways to optimize inter-operator communications: (i) reduce the cost of intra-node IPC and (ii) reduce the inter-node IPC traffic. Prior efforts have focused mainly on reducing the inter-node IPC traffic by scheduling highly communicating operator instances to workers in the same node [9][10][11][12]. However, they may encounter high latency due to the high cost of intranode IPC, resulting from the memory-to-memory copying inside a cluster node.…”
Section: Introductionmentioning
confidence: 99%
“…L'interrogation de ces flux via des requĂȘtes, dites continues 14 , reprĂ©sente un dĂ©fi majeur en termes de performance pour le traitement en temps rĂ©el : passage Ă  l'Ă©chelle pour la gestion de flux massifs et robustesse pour permettre le traitement continu des donnĂ©es. Afin de rĂ©pondre Ă  ces enjeux, des systĂšmes de gestion de flux de donnĂ©es (Schneider et al, 2009 ;Peng et al, 2015 ;Xu, Peng, 2016) ont Ă©tĂ© dĂ©veloppĂ©s. Dans la suite de cette section, nous nous intĂ©ressons aux approches qui permettent l'exĂ©cution de requĂȘtes continues de maniĂšre parallĂšle et distribuĂ©e.…”
Section: Contexteunclassified