Proceedings of the 20th Annual International Conference on Supercomputing 2006
DOI: 10.1145/1183401.1183419
|View full text |Cite
|
Sign up to set email alerts
|

Large files, small writes, and pNFS

Abstract: Workload characterization studies highlight the prevalence of small and sequential data requests in scientific applications. Parallel file systems excel at large data transfers but sometimes at the expense of small I/O performance. pNFS is an NFSv4.1 high-performance enhancement that provides direct storage access to parallel file systems while preserving NFSv4 operating system and hardware platform independence. This paper demonstrates that distributed file systems can increase write throughput to parallel da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2007
2007
2013
2013

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 21 publications
0
12
0
Order By: Relevance
“…In the future, we plan to apply our response time-based replica management strategy in HDFS [3], PVFS [11], pNFS [12], Gpfs [13], and LusterFS [14].…”
Section: Discussionmentioning
confidence: 99%
“…In the future, we plan to apply our response time-based replica management strategy in HDFS [3], PVFS [11], pNFS [12], Gpfs [13], and LusterFS [14].…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, we implement I/O buffering in BMF to batch small read and write operations into large ones before accessing the file system. This buffering strategy avoids flurries of small reads and writes that can degrade the I/O performance of large parallel file systems [20], [21].…”
Section: Parallelizing the Fastbit-based Mass Querymentioning
confidence: 99%
“…The traditional dynamic power management strategy is an efficient energy conservation technique for large idle time periods, which make it worthwhile to spin down disks when they are sitting idle. However, small and sequential data requests in modern scientific applications are very prevalent [7], making it less likely to observe large idle time intervals among requests. Moreover, small writes cause not only an energy consumption problem but also an I/O performance problem [1].…”
Section: Related Workmentioning
confidence: 99%
“…In the past decade, large-scale storage systems have been developed to achieve high I/O performance and large storage capacity for a wide variety of data-intensive applications [13] [6][10] [7]. Much attention has been paid to the issues of performance and security in storage systems [21][19] [2].…”
Section: Introductionmentioning
confidence: 99%