2019
DOI: 10.1109/tvlsi.2019.2897508
|View full text |Cite
|
Sign up to set email alerts
|

LordCore: Energy-Efficient OpenCL-Programmable Software-Defined Radio Coprocessor

Abstract: This paper proposes a single instruction multiple data (SIMD) processor, which is programmed with high-level OpenCL language. The low-power processor is customized for executing multiple-input-multiple-output (MIMO) detection algorithms at a high performance while consuming very little power making it suitable for software-defined radio (SDR) applications. The novel combination of SIMD operations on a transport programmed multicore datapath allows saving power on both the execution front end and the back end, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 48 publications
(65 reference statements)
0
4
0
Order By: Relevance
“…Reducing both the amount of register file ports and interconnect connectivity is important with wide processor architectures, as otherwise they quickly hit the ILP complexity wall [25] due to the amount of connectivity on the processor datapath. TTAs' modular structure allows heavy pruning of the interconnection network due to programmer-directed operand transportations, which enables easy scaling from high-performance [26] to small energy-efficient TTA designs [27]. Figure 2 describes an example of a TTA instruction template for an architecture with five busses.…”
Section: Transport Triggered Architecturesmentioning
confidence: 99%
“…Reducing both the amount of register file ports and interconnect connectivity is important with wide processor architectures, as otherwise they quickly hit the ILP complexity wall [25] due to the amount of connectivity on the processor datapath. TTAs' modular structure allows heavy pruning of the interconnection network due to programmer-directed operand transportations, which enables easy scaling from high-performance [26] to small energy-efficient TTA designs [27]. Figure 2 describes an example of a TTA instruction template for an architecture with five busses.…”
Section: Transport Triggered Architecturesmentioning
confidence: 99%
“…The proposed architecture shows a 14 × reduction in cycles, compared to a Texas Instruments C66x. Aside from that, there are some reports of TTAs being used for softwaredefined radio (SDR) implementation of mobile communication standards [12]- [14]. The TTA described in [13], which is optimized for decoding of polar codes, is reported to outperform state-of-the-art ASIP implementations fivefold in throughput while consuming an order of magnitude less energy.…”
Section: Related Workmentioning
confidence: 99%
“…They find that their TTA implementation achieves a 1.85 to 10.73 better mW/MHz rating than comparable architectures. The "LordCore", a TTA processor for the decoding of Turbo-Codes [14] is said to outperform GPU-based implementations by three orders of magnitude in the performance/power ratio. Compared to fixed-function implementations of the algorithm, the TTA implementation requires more power, but "[the] results show that the energy Fig.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…For programmability and reconfiguration purposes, field programmable gate arrays (FPGAs) are the platform of choice for implementing most SDR concepts (Balakrishnan et al, 2019). SDR processors are challenging to design because multiple-input multipleoutput (MIMO) orthogonal frequency division multiplexing (OFDM) links require tremendous processing power (Kultala et al, 2019).…”
Section: Introductionmentioning
confidence: 99%