2013 9th International Wireless Communications and Mobile Computing Conference (IWCMC) 2013
DOI: 10.1109/iwcmc.2013.6583710
|View full text |Cite
|
Sign up to set email alerts
|

Turbo decoding on tailored OpenCL processor

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…The Fast configuration, introduced in [13], provides clustered TTA-based processors that can reach high-frequency with large potential of parallel computing. We assume that a chip composed of Fast TTA processors can reach 1GHz using 40nm CMOS technology such as demonstrated in previous work [18].…”
Section: Processor Architecturementioning
confidence: 99%
See 1 more Smart Citation
“…The Fast configuration, introduced in [13], provides clustered TTA-based processors that can reach high-frequency with large potential of parallel computing. We assume that a chip composed of Fast TTA processors can reach 1GHz using 40nm CMOS technology such as demonstrated in previous work [18].…”
Section: Processor Architecturementioning
confidence: 99%
“…We assume that such platforms can be clocked at 1GHz. Indeed, previous work has shown that the processor cores can already reach 1GHz using 40nm technology [18]. Thus, the results are obtained from a simulated execution, but let us point out that successful implementations of the MPEG-4 Visual decoder has already been synthesized on two different FPGA boards clocked at 100MHz: Altera Stratix III and Xilinx Virtex 6.…”
Section: Analysis Of Performancementioning
confidence: 99%
“…In the past, vector TTA processors used interleaved memory, which increases latency and hardware complexity [4,13]. We have found that many signal processing applications can be written to use linear memory access, so the wide 256-bit data words of our simpler memory hierarchy are a fast and cost-effective solution.…”
Section: Symmetric Clusters For Parallel Codementioning
confidence: 99%
“…In past designs, it was assumed that a PCI Express bus could transport data to an applications processor when sequential execution was necessary [4]. In our previous work, we have found that transferring data between parallel and sequential processors was a significant bottleneck, particularly in heterogeneous CPU and GPU systems [6,14].…”
Section: Main Processor For Sequential Codementioning
confidence: 99%
See 1 more Smart Citation