DOI: 10.4018/978-1-61520-971-2.ch002
|View full text |Cite
|
Sign up to set email alerts
|

Towards Data Intensive Many-Task Computing

Abstract: Many-task computing aims to bridge the gap between two computing paradigms, high throughput computing and high performance computing. Traditional techniques to support many-task computing commonly found in scientific computing (i.e. the reliance on parallel file systems with static configurations) do not scale to today’s largest systems for data intensive application, as the rate of increase in the number of processors per system is outgrowing the rate of performance increase of parallel file systems. In this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0
1

Publication Types

Select...
4
3
2

Relationship

5
4

Authors

Journals

citations
Cited by 21 publications
(17 citation statements)
references
References 38 publications
(63 reference statements)
0
16
0
1
Order By: Relevance
“…FusionFS has already scaled to 1K nodes, and we aim to scale up FusionFS+HyCache to 10K nodes. We will also apply HyCache to Many-Task Computing (MTC) [31][32][33][34], which has specific emphasis on data-intensive computing [35] and cloud computing [36].…”
Section: Discussionmentioning
confidence: 99%
“…FusionFS has already scaled to 1K nodes, and we aim to scale up FusionFS+HyCache to 10K nodes. We will also apply HyCache to Many-Task Computing (MTC) [31][32][33][34], which has specific emphasis on data-intensive computing [35] and cloud computing [36].…”
Section: Discussionmentioning
confidence: 99%
“…FusionFS is optimized for a subset of HPC and many-task computing (MTC) [12,59,62,63] workloads, and it is designed for extreme scales [61]. These workloads are often extremely data-intensive [56,58,60], and optimizing data locality [55] becomes critical to achieving good scalability and performance.…”
Section: A Fusionfs: Distributed Metadata Managementmentioning
confidence: 99%
“…Due to the independence of the tasks, BOT applications have been considered most suitably to be executed over widely distributed computational grids. However, we believe that BOT applications should also be able to be executed on other large-scale systems, such as clouds with the help of workflow systems and the ManyTask Computing (MTC) paradigm [7] [45].…”
Section: Introductionmentioning
confidence: 99%
“…Tasks can be grouped into BOT within one layer, and be scheduled together in the unit of BOT. A large number of applications [42][44] [45] have been covered with this new programming model, spanning everything from supercomputers [44] to grids [50] and clouds [47], and dataintensive systems [46]. Though there are differences between the MTC-based scientific computing workloads and the initial target workloads of clouds (e.g.…”
Section: Introductionmentioning
confidence: 99%