Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing 2010
DOI: 10.1145/1851476.1851583
|View full text |Cite
|
Sign up to set email alerts
|

Towards optimising distributed data streaming graphs using parallel streams

Abstract: Modern scientific collaborations have opened up the opportunity of solving complex problems that involve multidisciplinary expertise and large-scale computational experiments. These experiments usually involve large amounts of data that are located in distributed data repositories running various software systems, and managed by different organisations. A common strategy to make the experiments more manageable is executing the processing steps as a workflow. In this paper, we look into the implementation of fi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…Thus, a good optimization choice is to avoid assigning these PEIs on separate DIVMs unless the competition for the CPU or RAM outweighs the transfer cost. We have achieved a linear speed-up in optimizing this workflow, as reported in Liew et al [2] and Han et al [9].…”
Section: Demonstration Of Performance Database Usagementioning
confidence: 63%
“…Thus, a good optimization choice is to avoid assigning these PEIs on separate DIVMs unless the competition for the CPU or RAM outweighs the transfer cost. We have achieved a linear speed-up in optimizing this workflow, as reported in Liew et al [2] and Han et al [9].…”
Section: Demonstration Of Performance Database Usagementioning
confidence: 63%
“…Doing so would bring immediate benefits in terms of reproducibility, but also in terms of standardization of components and interfaces. 38,60,61 There are many examples of stream [62][63][64] and event 41,65,66 processing frameworks, and recent works have combined streaming with more traditional file-based workflows. 67,68 Applications such as the marine sensor network (Section 2.6), distributed network intrusion detection (Section 2.13), power grids (Section 2.9), and industrial incident notification and response (Section 2.11) use stream and event processing abstractions.…”
Section: Examples Of Specific Cyberinfrastructure Usagementioning
confidence: 99%
“…Besid es, to achieve high energy efficiency and low response tim e in big d ata stream com pu ting environm ents, the au thors of [2] and [3] propose a real-tim e and energy-efficient resou rce sched u ling and optim ization fram ew ork, term ed the Re-Stream , w hich aid s in calcu lating the energy consu m ption of a resou rce allocation schem e for a d ata stream graph. What is m ore, a partitioning-based d ataintensive w orkflow optim ization algorithm [37], [39] has been proposed to provid e significantly red u ced latency w ith increase in the throu ghpu t. H ow ever, the issu e of VM allocation has not been properly ad d ressed in geod istribu ted DCs for stream ing w orkflow. Only a sm all nu m ber of w orks ad d ress VM allocation in clou d com p uting system s for stream ing big d ata processing.…”
Section: Streaming Workflow Optimizationmentioning
confidence: 99%