1999
DOI: 10.1109/4.799870
|View full text |Cite
|
Sign up to set email alerts
|

A microprocessor with a 128-bit CPU, ten floating-point MAC's, four floating-point dividers, and an MPEG-2 decoder

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2001
2001
2020
2020

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(11 citation statements)
references
References 1 publication
0
10
0
Order By: Relevance
“…Until recently, few of these architectural concepts made it into mainstream processor designs, but the increasingly stringent power/performance requirements for embedded systems have resulted in a number of recent implementations that have adopted these concepts. Chips like the Sony Emotion Engine [20,23,29] and Intel's MXP5800 both achieved high performance at low power by adopting three levels (registers, local memory, external DRAM) of softwaremanaged memory. More recently, the STI Cell processor has adopted a similar approach where data movement between Figure 1: Overview of the Cell processor these three address spaces is explicitly controlled by the application.…”
Section: Related Workmentioning
confidence: 99%
“…Until recently, few of these architectural concepts made it into mainstream processor designs, but the increasingly stringent power/performance requirements for embedded systems have resulted in a number of recent implementations that have adopted these concepts. Chips like the Sony Emotion Engine [20,23,29] and Intel's MXP5800 both achieved high performance at low power by adopting three levels (registers, local memory, external DRAM) of softwaremanaged memory. More recently, the STI Cell processor has adopted a similar approach where data movement between Figure 1: Overview of the Cell processor these three address spaces is explicitly controlled by the application.…”
Section: Related Workmentioning
confidence: 99%
“…Thus, not only contention and access latency are reduced but also the data transfer time can be hidden. A clear example is the processor of the PlayStation 2 videogaming system [9], composed by a main general purpose core, two vector units and a scratch-pad memory in which data are moved through a programmable DMA controller. The Cell processor [7] from Sony, IBM and Toshiba is the ideal successor of that design.…”
Section: Related Workmentioning
confidence: 99%
“…The architecture space of such systems varies greatly and typically includes a mid/high-performance scalar processor executing the OS, I/O and high-level data flow scheduling, complemented by a number of programmable or hardwired custom accelerators tuned to accelerate certain core functions of the target application [2]. These accelerators occasionally are custom VLIW engines as the VLIW architectural paradigm [3] seems to be the most promising in delivering very high single-processor performance, particularly on data-parallel inner loops; interestingly enough, such processors seem to be finding niche markets in behavioral (ESL) synthesis as a core component of the underlying VLSI platform [4].…”
Section: Introductionmentioning
confidence: 99%