2007
DOI: 10.1145/1233307.1233308
|View full text |Cite
|
Sign up to set email alerts
|

The WaveScalar architecture

Abstract: Silicon technology will continue to provide an exponential increase in the availability of raw transistors. Effectively translating this resource into application performance, however, is an open challenge that conventional superscalar designs will not be able to meet. We present WaveScalar as a scalable alternative to conventional designs. WaveScalar is a dataflow instruction set and execution model designed for scalable, low-complexity/high-performance processors. Unlike previous dataflow machines, WaveScala… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
50
0
48

Year Published

2008
2008
2021
2021

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 119 publications
(104 citation statements)
references
References 56 publications
0
50
0
48
Order By: Relevance
“…In fact, parallelism breaks the computation into finer-grain chunks on separate regions. This reduces global memory accesses by leveraging local storage in regions and optional local memories, which is analogous to spatial computing approaches in classical computing [14,45,46].…”
Section: The Multi-simd Architectural Modelmentioning
confidence: 99%
“…In fact, parallelism breaks the computation into finer-grain chunks on separate regions. This reduces global memory accesses by leveraging local storage in regions and optional local memories, which is analogous to spatial computing approaches in classical computing [14,45,46].…”
Section: The Multi-simd Architectural Modelmentioning
confidence: 99%
“…Tiled architectures, such as Raw [Taylor et al 2004], TRIPS [Sankaralingam et al 2003], and WaveScalar [Swanson et al 2007], are a common approach to improving scalability because they reduce wire delay. Scalable CoDA systems also use a tiled architecture for this reason and to distribute coprocessors among multiple memory and host interfaces.…”
Section: Related Workmentioning
confidence: 99%
“…The dataflow model [7], is one of the major contenders in meeting the above criteria. Unfortunately this model is relatively inefficient and also has difficulty in capturing the imperative programming style.…”
Section: Introductionmentioning
confidence: 99%
“…Unfortunately this model is relatively inefficient and also has difficulty in capturing the imperative programming style. The former is illustrated in [7], which describes a loop summing its own index that requires seven instructions in its body, six instructions of overhead for a single operation. For comparison, the sequential model has an overhead of two instructions to implement the same loop and more to the point, the micro-architecture described here has a zero-instruction overhead.…”
Section: Introductionmentioning
confidence: 99%