2014 IEEE International Congress on Big Data 2014
DOI: 10.1109/bigdata.congress.2014.137
|View full text |Cite
|
Sign up to set email alerts
|

A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures

Abstract: Abstract-Scientific problems that depend on processing large amounts of data require overcoming challenges in multiple areas: managing large-scale data distribution, co-placement and scheduling of data with compute resources, and storing and transferring large volumes of data. We analyze the ecosystems of the two prominent paradigms for data-intensive applications, hereafter referred to as the high-performance computing and the Apache-Hadoop paradigm. We propose a basis, common terminology and functional facto… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
48
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 62 publications
(48 citation statements)
references
References 26 publications
(23 reference statements)
0
48
0
Order By: Relevance
“…It is also worth noting that this workflow is essentially a materials science adaptation of existing similar workflows of data-driven analytics in other domains, as most of the advanced techniques for big data management and informatics come from the field of computer science and more specifically high-performance data mining, [42][43][44][45][46][47][48][49][50] via applications in many different [64][65][66][67][68] and social media analytics, 69-71 among many others.…”
Section: Knowledge Discovery Workflow For Materials Informaticsmentioning
confidence: 99%
“…It is also worth noting that this workflow is essentially a materials science adaptation of existing similar workflows of data-driven analytics in other domains, as most of the advanced techniques for big data management and informatics come from the field of computer science and more specifically high-performance data mining, [42][43][44][45][46][47][48][49][50] via applications in many different [64][65][66][67][68] and social media analytics, 69-71 among many others.…”
Section: Knowledge Discovery Workflow For Materials Informaticsmentioning
confidence: 99%
“…In [8], the authors compare the performance of traditional HPC setups against Hadoop-like frameworks over clusters of commodity hardware with respect to the processing of dataintensive workloads. They also propose a set of Big Data applications as a benchmark to investigate and evaluate the different paradigms.…”
Section: Related Workmentioning
confidence: 99%
“…Request permissions from permissions@acm.org. duce, among others, have already percolated into data intensive computing within HPC [20]. In addition, there are efforts to support traditional HPC-centric scientific computing applications in virtualized cloud infrastructure.…”
Section: Introductionmentioning
confidence: 99%