Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing 2013
DOI: 10.1145/2493123.2462909
|View full text |Cite
|
Sign up to set email alerts
|

I/O acceleration with pattern detection

Abstract: The I/O bottleneck in high-performance computing is becoming worse as application data continues to grow. In this work, we explore how patterns of I/O within these applications can significantly affect the effectiveness of the underlying storage systems and how these same patterns can be utilized to improve many aspects of the I/O stack and mitigate the I/O bottleneck. We offer three main contributions in this paper. First, we develop and evaluate algorithms by which I/O patterns can be efficiently discovered … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 48 publications
(14 citation statements)
references
References 31 publications
0
14
0
Order By: Relevance
“…Closer to our approach is the work by He et al [7], who propose an approach to spatial I/O pattern detection to improve metadata indexing in PLFS. Their approach considers a sequence of (offset,size) access parameters and tries to find repetitive patterns in the differences (delta) between consecutive accesses, using a method inspired by the LZ77 sliding window algorithm.…”
Section: ) Spatial Predictionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Closer to our approach is the work by He et al [7], who propose an approach to spatial I/O pattern detection to improve metadata indexing in PLFS. Their approach considers a sequence of (offset,size) access parameters and tries to find repetitive patterns in the differences (delta) between consecutive accesses, using a method inspired by the LZ77 sliding window algorithm.…”
Section: ) Spatial Predictionsmentioning
confidence: 99%
“…The effectiveness of such techniques strongly depends on a certain level of knowledge of the I/O access patterns: prefetching and caching indeed require the location of future accesses (i.e., spatial behavior), while I/O scheduling leverages estimations of I/O requests interarrival time (i.e., temporal behavior). The key challenges inherent in these techniques include the proper comprehension and exploitation of the application's I/O behavior within the I/O stack itself [7], [5]. Hence, modeling and predicting the applications I/O behavior are of utmost importance.…”
Section: Introductionmentioning
confidence: 99%
“…He et al [33] propose an approach to improve read operations with the PLFS library. Information from trace files -generated while writing data -is used to reconstruct the high-level data structures used by the application (through MPI-IO or HDF5).…”
Section: Hybrid Runtime + Postmortem Approachesmentioning
confidence: 99%
“…Another recent study introduced methods to automatically identify I/O patterns at the block level and utilized them to manage the metadata indexing and support data prefetching in physics and geoscience applications [38]. Statistical analysis of NetCDF scientific datasets was successfully explored over the Hadoop platform developed for big data applications [10].…”
Section: Big Data + Hpc Integrated Sw Stackmentioning
confidence: 99%