Proceedings of the 8th Parallel Data Storage Workshop 2013
DOI: 10.1145/2538542.2538564
|View full text |Cite
|
Sign up to set email alerts
|

Structuring PLFS for extensibility

Abstract: The Parallel Log Structured Filesystem (PLFS) [5] was designed to transparently transform highly concurrent, massive high-performance computing (HPC) N-to-1 checkpoint workloads into N-to-N workloads to avoid single-file performance bottlenecks in typical HPC distributed filesystems. PLFS has produced speedups of 2-150X for N-1 workloads at Los Alamos National Lab. Having successfully improved N-1 performance, we have restructured PLFS for extensibility so that it can be applied to more workloads and storage s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 18 publications
(16 reference statements)
0
6
0
Order By: Relevance
“…In order to achieve a maximum amount of local data reads, we employ the standard max-flow algorithm, Ford-Fulkerson [7], to compute the largest flow from s to t. The algorithm will iterate many times. In each iteration it increases the number of tasks/files assigned to processes.…”
Section: B Optimization Of Parallel Single-data Accessmentioning
confidence: 99%
See 3 more Smart Citations
“…In order to achieve a maximum amount of local data reads, we employ the standard max-flow algorithm, Ford-Fulkerson [7], to compute the largest flow from s to t. The algorithm will iterate many times. In each iteration it increases the number of tasks/files assigned to processes.…”
Section: B Optimization Of Parallel Single-data Accessmentioning
confidence: 99%
“…In each iteration it increases the number of tasks/files assigned to processes. With the use of flow-augmenting paths [7], if a task t has been assigned to process i, but the overall size the graph's maximum matching could be increased by matching t with another process j, the assignment of t to i will be canceled and t is reassigned to j. Such a cancellation policy enables the assignments of processes on tasks to be optimal.…”
Section: B Optimization Of Parallel Single-data Accessmentioning
confidence: 99%
See 2 more Smart Citations
“…Many researches have been proposed to use the Hadoop system for parallel data processing. Gibson [3] and Sun [5] propose methods to write parallel data into HDFS and achieve high I/O performance. MRAP [8] is proposed to reconstruct scientific data according to data access patterns to assist data processing using the Hadoop system.…”
Section: Related Workmentioning
confidence: 99%