2008
DOI: 10.1093/comjnl/bxm099
|View full text |Cite
|
Sign up to set email alerts
|

A Review of SIMD Multimedia Extensions and their Usage in Scientific and Engineering Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(21 citation statements)
references
References 33 publications
0
21
0
Order By: Relevance
“…Existing solutions for multimedia and matrix operations mostly focus on 1D [15] and 2D [3] arrangements of processing elements [16]. In this section, we use matrix multiplication to give a perspective of differences between 1D and 2D arrangement of PEs for matrix computations.…”
Section: A 1d Vs 2d Architecturesmentioning
confidence: 99%
See 1 more Smart Citation
“…Existing solutions for multimedia and matrix operations mostly focus on 1D [15] and 2D [3] arrangements of processing elements [16]. In this section, we use matrix multiplication to give a perspective of differences between 1D and 2D arrangement of PEs for matrix computations.…”
Section: A 1d Vs 2d Architecturesmentioning
confidence: 99%
“…Three main limitations of conventional 1D vector architectures are known to be complexity of the central register file, implementation difficulties of precise exception handling, and expensive onchip memory [20]. A detailed review of SIMD multimedia extensions and their bottlenecks are presented in [15], [39]. Associated costs are amplified by the fact that in each step a complete vector has to be transferred through multiple ports of a register file, wide wires, and complex point-to-point interconnects such as crossbars.…”
Section: B Related Workmentioning
confidence: 99%
“…All major vendors support vector instructions and the trend is pushing them to become wider and more powerful [1]. SIMD instruction set extensions are quite common today in both high performance and embedded microprocessors [2]. However, writing code that makes efficient use of these units and leads to platform-specific implementations is rather difficult [3].…”
Section: Introductionmentioning
confidence: 99%
“…For instance, a set of single-instruction multiple-data (SIMD) registers have been employed to parallelize data operations within the processor [29]. In 1999, Intel's Pentium III processor family first introduced the streaming SIMD extensions (SSE) instructions (Intel Corp; Santa Clara, California).…”
Section: Introductionmentioning
confidence: 99%
“…In 1999, Intel's Pentium III processor family first introduced the streaming SIMD extensions (SSE) instructions (Intel Corp; Santa Clara, California). SSE expand the SIMD execution model by a new set of 128-bit registers to provide the ability to perform SIMD operations on packed and scalar singleprecision floating-point values [29][30]. SSE2 was then introduced in 2001 along with the Pentium IV and Intel Xeon processors [30] to enable more computations in parallel.…”
Section: Introductionmentioning
confidence: 99%