2005
DOI: 10.1147/rd.492.0255
|View full text |Cite
|
Sign up to set email alerts
|

Blue Gene/L compute chip: Memory and Ethernet subsystem

Abstract: The Blue Genet/L compute chip is a dual-processor system-on-a-chip capable of delivering an arithmetic peak performance of 5.6 gigaflops. To match the memory speed to the high compute performance, the system implements an aggressive three-level on-chip cache hierarchy. The implemented hierarchy offers high bandwidth and integrated prefetching on cache hierarchy levels 2 and 3 (L2 and L3) to reduce memory access time. A Gigabit Ethernet interface driven by direct memory access (DMA) is integrated in the cache h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2006
2006
2012
2012

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(13 citation statements)
references
References 5 publications
0
13
0
Order By: Relevance
“…Although not found in current high-performance server systems, CPUs with integrated Ethernet NICs appear in other environments [8,33], are rumored to be present in some upcoming servers [13], and have been shown to provide substantial performance improvements on server workloads [5]. (See Section 2.1 for further discussion.)…”
Section: Introductionmentioning
confidence: 99%
“…Although not found in current high-performance server systems, CPUs with integrated Ethernet NICs appear in other environments [8,33], are rumored to be present in some upcoming servers [13], and have been shown to provide substantial performance improvements on server workloads [5]. (See Section 2.1 for further discussion.)…”
Section: Introductionmentioning
confidence: 99%
“…The ultimate goal, of course, is to satisfy most of the references from L1. Keeping the L1 size fixed and assuming it is not fully associative (although there exist some examples of fully associative L1s in real machines [24]), we try to decrease conflict misses by reducing contention among L1 sets. We do so by keeping some blocks out of the L1.…”
Section: Adaptive Block Placement (Abp)mentioning
confidence: 99%
“…The experimental data collection for this study is undertaken on x86-based standalone systems, Cray XT3 [9], Cray XT4 [10], IBM Blue Gene/L [15], and Blue Gene/P [4] systems. The x86-based standalone systems include an eight socket, dual-core AMD Opteron 8216 [2], an eight socket, quad-core AMD Opteron 8350 [1], and an Intel quad-core Clovertown system [18].…”
Section: Testing Environment Hardwarementioning
confidence: 99%