2008
DOI: 10.1177/1094342007085019
|View full text |Cite
|
Sign up to set email alerts
|

An Evaluation of the Oak Ridge National Laboratory Cray XT3

Abstract: In 2005, Oak Ridge National Laboratory (ORNL) received delivery of a 5294 processor Cray XT3. The XT3 is Cray's third-generation massively parallel processing system. The ORNL system uses a single-processor node built around the AMD Opteron and uses a custom chip—called SeaStar—for interprocessor communication. The system uses a lightweight operating system called Catamount on its compute nodes. This paper provides a performance evaluation of the Cray XT3, including measurements for micro-benchmark, kernel, an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2009
2009
2015
2015

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 25 publications
0
10
0
Order By: Relevance
“…The average improvement is 1.1%, 9.0%, 15.1%, and 17.2% for two, four, eight, and sixteen threads, respectively. Therefore, taking advantage of Itanium2 caches on FINISTERRAE is critical, because accesses to memory incur in a high penalty in comparison with other systems [11]. Table 3 shows the SpMV performance without optimizations and when the optimizations detailed previously (data and thread allocation, and locality improvement) are applied together.…”
Section: Data Reorderingmentioning
confidence: 97%
“…The average improvement is 1.1%, 9.0%, 15.1%, and 17.2% for two, four, eight, and sixteen threads, respectively. Therefore, taking advantage of Itanium2 caches on FINISTERRAE is critical, because accesses to memory incur in a high penalty in comparison with other systems [11]. Table 3 shows the SpMV performance without optimizations and when the optimizations detailed previously (data and thread allocation, and locality improvement) are applied together.…”
Section: Data Reorderingmentioning
confidence: 97%
“…In current high performance architectures, the ratio of F/ C is in the range of 0.5-1. For example, in Cray XT 3 (Alam et al, 2008), F = 4.8 Â 10 9 , and the links have a peak bandwidth of 7.6 GB/s. With small cache blocks and miss rate in the neighborhood of 10% or less (due to the fact that programmers are going to target and distribute their applications for maximum locality, thus most accesses on well behaved applications are going to fall in cache), the resulting T/R is in the range of 0.05-1.…”
Section: Dataset Detailsmentioning
confidence: 99%
“…Cray provides a Message Passing Interface (MPI) communication based on MPICH version 1.2 that uses Portals for data transfer. Details of the system can be found in [12].…”
Section: 21mentioning
confidence: 99%