2019 International Conference on High Performance Big Data and Intelligent Systems (HPBD&IS) 2019
DOI: 10.1109/hpbdis.2019.8735495
|View full text |Cite
|
Sign up to set email alerts
|

Twister2: TSet High-Performance Iterative Dataflow

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
24
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 17 publications
(24 citation statements)
references
References 14 publications
0
24
0
Order By: Relevance
“…The dataflow model is a trend in Big Data applications [24]. There are several flow-based programming frameworks often defining applications as networks of "black box" processes, which exchange data through predefined inputs and outputs.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The dataflow model is a trend in Big Data applications [24]. There are several flow-based programming frameworks often defining applications as networks of "black box" processes, which exchange data through predefined inputs and outputs.…”
Section: Related Workmentioning
confidence: 99%
“…There are several flow-based programming frameworks often defining applications as networks of "black box" processes, which exchange data through predefined inputs and outputs. Those frameworks use different ways to define a flow: by using a functional-based language, such as Apache Spark and Twister2 [24]; by using a skeleton-based pipelines, such as Ruffus [25] and Cosmos [26]; or even visually, such as RapidMiner [27], Orange [28] and KNIME [29].…”
Section: Related Workmentioning
confidence: 99%
“…While dataflow is a prevalent model in many parallel and distributed programming frameworks [4], functional programming is slowly becoming a commom interface. In addition to Big Data frameworks like Spark, Flink and Swift [9], functional interfaces are also being frequently used in other Data Sciences programming tools (e.g., Scikit-Learn 1 and Pandas 2 use it to express their dataflow models).…”
Section: Related Workmentioning
confidence: 99%
“…Convergence between high-performance computing (HPC) and Big Data has become an important research area, driven in part by the need to incorporate high-level libraries, platforms, and algorithms for machine learning and graph processing, and in part by the idea of using Big Data's fine-grained data awareness to increase the productivity of HPC systems [1], [2]. Several proposals of higher-level abstractions have emerged to address the requirements of these two areas in computer systems [3], [4]. Recent frameworks, like COMPSs [3], Twister2 [4], Spark [5] and Flink [6], share a common dataflow programming model, but are still focused on a single area.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation