2019
DOI: 10.1002/cpe.5189
|View full text |Cite
|
Sign up to set email alerts
|

Twister2: Design of a big data toolkit

Abstract: Data-driven applications are essential to handle the ever-increasing volume, velocity, and veracity of data generated by sources such as the Web and Internet of Things (IoT) devices. Simultaneously, an event-driven computational paradigm is emerging as the core of modern systems designed for database queries, data analytics, and on-demand applications. Modern big data processing runtimes and asynchronous many task (AMT) systems from high performance computing (HPC) community have adopted dataflow event-driven … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 16 publications
(19 citation statements)
references
References 69 publications
0
17
0
1
Order By: Relevance
“…The integration presented here 1 provides APIs in Python and Java. We chose those two languages because Java is the native language for COMPSs and HDFS, while Python popular in Data Science scenarioes, and it is required for the Lemonade environment, which will be described later.…”
Section: Data Abstractionmentioning
confidence: 99%
See 3 more Smart Citations
“…The integration presented here 1 provides APIs in Python and Java. We chose those two languages because Java is the native language for COMPSs and HDFS, while Python popular in Data Science scenarioes, and it is required for the Lemonade environment, which will be described later.…”
Section: Data Abstractionmentioning
confidence: 99%
“…Using HDFS, each fragment can be read in parallel by the multiple instances of the task (task1). From there, the next steps are similar to the existing solutions in COMPSs programming, that 1 https://github.com/eubr-bigsea/compss-hdfs Algorithm 1: COMPSs HDFS API usage example.…”
Section: Data Abstractionmentioning
confidence: 99%
See 2 more Smart Citations
“…Já em cenários convencionalmente denominados de Big Data, que buscam processar grandes volumes de dados de diversos tipos, normalmente não estruturados, utilizam hardware convencional e se valem fortemente de técnicas de paralelismo de dados. Nesse caso, os dados podem ser processados como fluxos individuais e analisados coletivamente em stream ou em lote, para a descoberta de conhecimento, sendo a mineração em Big Data uma das tarefas chaves em muitos domínios da ciência [Kamburugamuve et al 2017].…”
Section: Introductionunclassified