Adding a vector unit to a superscalar processor

Quintana, Francisca; Corbal, Jesús; Espasa, Roger; Valero, Mateo

doi:10.1145/305138.305148

Cited by 36 publications

(37 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Parameters are similar to those found in some recent microprocessors with multimedia extensions like PowerPC970. For VMMX versions a vector cache was used [22]. The vector cache is a twobank interleaved cache targeted at accessing stride-one vector requests by loading two whole cache lines (one per bank) instead of individually loading the vector elements.…”

Section: Memory Hierarchy Modelmentioning

confidence: 99%

On the Scalability of 1- and 2-Dimensional SIMD Extensions for Multimedia Applications

Sánchez

Álvarez

Salamf

et al. 2005

IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005.

Self Cite

View full text Add to dashboard Cite

Section: Memory Hierarchy Modelmentioning

confidence: 99%

On the Scalability of 1- and 2-Dimensional SIMD Extensions for Multimedia Applications

Sánchez

Álvarez

Salamf

et al. 2005

IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005.

Self Cite

View full text Add to dashboard Cite

“…The correctness of the output was verified to ensure no visually perceptible losses in accuracy. Finally, we modified our Jinks simulator [10] to be able to filter the input instruction stream provided by ATOM [15] and correctly simulate the emulated instructions.…”

Section: Emulation Libraries and Code Generationmentioning

confidence: 99%

“…In [10], we studied the design of cost-effective cache hierarchies to leverage high-bandwidth for out-of-order vector processors. In the same way as conventional vector instructions, MOM memory patterns have the potential to allow a smart exploitation of the spatial locality intrinsic in multimedia codes.…”

Section: Cache Hierarchymentioning

confidence: 99%

“…The vector cache was proposed in [10] and heavily borrows from the ideas introduced in [20]. As it can be seen in figure 6, the vector cache is targeted at accessing stride-one vector requests by loading two whole cache lines (one per interleaved bank) instead of individually loading the vector elements.…”

Section: Cache Hierarchymentioning

confidence: 99%

“…Recent works dealing with the DLP exploitation of multimedia applications can be divided into two different groups: those evaluating the performance of conventional vector ISAs on multimedia codes [9,10], and those evaluating the performance of current multimedia extensions [11,12].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations