Proceedings 1999 IEEE International Conference on Computer Design: VLSI in Computers and Processors (Cat. No.99CB37040)
DOI: 10.1109/iccd.1999.808601
|View full text |Cite
|
Sign up to set email alerts
|

TriMedia CPU64 architecture

Abstract: We present a new VLIW core as a successor to the TriMedia TM1000. The processor is targeted for embedded use in media-processing devices like DTVs and set-top boxes. Intended as a core, its design must be supplemented with on-chip co-processors to obtain a cost-effective system. Good performance is obtained through a uniform 64-bit 5 issue-slot VLIW design, supporting subword parallelism with an extensive instruction set optimized with respect to media-processing. Multi-slot 'super-ops' allow powerful multi-ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 56 publications
(39 citation statements)
references
References 6 publications
(4 reference statements)
0
39
0
Order By: Relevance
“…Due to realization constraints, the RFU is likely to be located far away from the Register File (RF) in the floorplan of the TriMedia/CPU64. The immediate effect is that there will be large delays in transferring data between the RFU and RF, and the RFU will not benefit from bypassing capabilities of the RF [18]. Consequently, read and write back cycles have explicitely to be provided.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Due to realization constraints, the RFU is likely to be located far away from the Register File (RF) in the floorplan of the TriMedia/CPU64. The immediate effect is that there will be large delays in transferring data between the RFU and RF, and the RFU will not benefit from bypassing capabilities of the RF [18]. Consequently, read and write back cycles have explicitely to be provided.…”
Section: Discussionmentioning
confidence: 99%
“…Finally, eight 1-D IDCTs (two SIMD 1-D IDCTs) are computed having the results generated by the transposition as inputs. Following the described procedure, a complete 2-D IDCT including all overheads (mostly composed of load and store operations) can be performed in cycles [18].…”
Section: ¢ Idct Implementation On Standard Trimediamentioning
confidence: 99%
See 1 more Smart Citation
“…Experiment results on cache effectiveness for media applications in [19,20,21,43] show that a larger cache size hardly increases the performance in selected media applications due to streaming data nature and low temporal locality. However, data in media applications has higher spatial locality.…”
Section: Cache Memory Characteristics In Media Applicationsmentioning
confidence: 99%
“…The larger cache memory capacity, though beneficial in general, does not guarantee a higher performance for all the applications as they may not utilize the full cache efficiently. Moreover, a larger cache memory hardly increases the performance of multimedia apphcations due to the streaming nature and lack of temporal locality in media data [19,20,21].…”
Section: Motivation and Approachmentioning
confidence: 99%