2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02)
DOI: 10.1109/ccgrid.2002.1017117
|View full text |Cite
|
Sign up to set email alerts
|

Grid Datafarm Architecture for Petascale Data Intensive Computing

Abstract: The Grid Datafarm (Gfarm) architecture is designed for global petascale data-intensive computing. It provides a global parallel filesystem with online petascale storage, scalable I/O bandwidth, and scalable parallel processing, and it can exploit local I/O in a grid of clusters with tens of thousands of nodes. Gfarm parallel I/O APIs and commands provide a single filesystem image and manipulate filesystem metadata consistently. Fault tolerance and load balancing are automatically managed by file duplication o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
67
0

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 108 publications
(67 citation statements)
references
References 5 publications
(5 reference statements)
0
67
0
Order By: Relevance
“…If parallelism between heterogeneous networks and elements is used, it is possible to asynchronously send the suitable block size BS C S to each resource. The block size of each element is obtained independently by means of (6). In this sense, faster storage elements receive more blocks according to their transfer rate.…”
Section: Decision-making For File Creating Requestsmentioning
confidence: 99%
See 1 more Smart Citation
“…If parallelism between heterogeneous networks and elements is used, it is possible to asynchronously send the suitable block size BS C S to each resource. The block size of each element is obtained independently by means of (6). In this sense, faster storage elements receive more blocks according to their transfer rate.…”
Section: Decision-making For File Creating Requestsmentioning
confidence: 99%
“…The broker not only performs the decision-making but is also essential for managing these systems. In this sense, GAS has been conceived as a brokering-based solution and can therefore be used with minor changes in conjunction with any data access service, such as Storage Resource Manager (SRM) [5], Grid Datafarm (Gfarm) [6], MAPFS-Grid [7], etc. designed to provide self-management features to the system.…”
Section: Gas Architecturementioning
confidence: 99%
“…A very large distributed storage space is thus made available to applications that usually use file storage, with no need for modifications. This approach has been taken by a few projects like GFarm [34], GridNFS [13], GPFS [29], XtreemFS [14], etc. Implementing transparent access at a global scale naturally leads however to a number of challenges related to scalability and performance, as the file system is put under pressure by a very large number of concurrent accesses.…”
Section: Related Workmentioning
confidence: 99%
“…As such, Plan 9 nodes interoperate only with one other. In the Grid context, the construction of a single, uniform and hierarchical file namespace has been studied in [34] and [35].…”
Section: Related Workmentioning
confidence: 99%