2015
DOI: 10.1016/j.bdr.2015.01.007
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Collective I/O Scheduling for High-Performance Computing

Abstract: The non-contiguous access pattern of many scientific applications results in a large number of I/O requests, which can seriously limit the data-access performance. Collective I/O has been widely used to address this issue. However, the performance of collective I/O could be dramatically degraded in today's high-performance computing systems due to the increasing shuffle cost caused by highly concurrent data accesses. This situation tends to be even worse as many applications become more and more data intensive… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 31 publications
(58 reference statements)
0
3
0
Order By: Relevance
“…A typical practice is that each developer may focus on only fixing one single failing test case where the passing test cases are utilized. The idea of parallel is pretty popular is the area of big data and I/O [15] [16]. Although some techniques, including test cases clustering, are possibly adoptable in parallel debugging, this paper employs a plain and simple parallel debugging where each processer, and not necessarily test engineer, debug one single failing test case where all the other passing test cases are also utilized.…”
Section: Parallel and Sequential Debuggingmentioning
confidence: 99%
“…A typical practice is that each developer may focus on only fixing one single failing test case where the passing test cases are utilized. The idea of parallel is pretty popular is the area of big data and I/O [15] [16]. Although some techniques, including test cases clustering, are possibly adoptable in parallel debugging, this paper employs a plain and simple parallel debugging where each processer, and not necessarily test engineer, debug one single failing test case where all the other passing test cases are also utilized.…”
Section: Parallel and Sequential Debuggingmentioning
confidence: 99%
“…The occurrence of stragglers has significant effects on I/O performance of object storage systems. Since in HPC applications, clients normally need to synchronize after each I/O phase [3,17], the overall I/O performance will be determined by the longest one, which in turn is determined by the slowest object storage server. In general, the slow storage servers (i.e., stragglers) can be divided into two categories: long-term stragglers and short-term stragglers.…”
Section: Introductionmentioning
confidence: 99%
“…Estimating the time that applications spend on reading and writing data is a common task for their I/O performance tuning [19,20,10,31] and job scheduling [14] on HPC systems, and SQL query plan optimization [2,30] in database systems. One specific example is the Scientific Data Services Framework (SDS) [8,7,9] we are working on.…”
Section: Introductionmentioning
confidence: 99%