2008 IEEE International Conference on Computer Design 2008
DOI: 10.1109/iccd.2008.4751880
|View full text |Cite
|
Sign up to set email alerts
|

A family of scalable FFT architectures and an implementation of 1024-point radix-2 FFT for real-time communications

Abstract: The paper presents a family of architectures for FFT implementation based on the decomposition of the perfect shuffle permutation, which can be designed with variable number of processing elements. This provides designers with a trade-off choice of speed vs. complexity (cost and area.). A detailed case study is provided on the implementation of 1024-point FFT with 2 processing elements using 45nm process technology, including area, timing, power and place-and-route results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2009
2009
2017
2017

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 19 publications
0
4
0
Order By: Relevance
“…Four single-port RAM memories are used in [32] combined with one radix-2 processing element. A scalable pipelined architecture is presented in [26] where two or four processing elements can be used with perfect shuffle permutation and data reordering. High energy-efficiency is achieved by using modern technology and low supply voltage.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Four single-port RAM memories are used in [32] combined with one radix-2 processing element. A scalable pipelined architecture is presented in [26] where two or four processing elements can be used with perfect shuffle permutation and data reordering. High energy-efficiency is achieved by using modern technology and low supply voltage.…”
Section: Methodsmentioning
confidence: 99%
“…FFT implementations can be divided to two categories: memory based, e.g., [25,31] or pipelines, e.g., [11,26,30]. Pipelined FFTs are usually used in short FFTs since as the size of the FFT increases, more intermediate storage is needed and memory is more efficient storage for large amount of data than registers and delay elements used in pipelines.…”
mentioning
confidence: 99%
“…This can be applied sequentially as follows: Reordering operators can operate on two data indices at a time instead of three as shown in (7) [16] for further decomposition detail.…”
Section: Decompositions Of Perfect Shuffle (Unshuffle) Reorderingmentioning
confidence: 99%
“…In some cases the length of the processed sequence is fixed but there exist also very flexible, variable length, and memory-based FFT implementations [GRH11]. A more detailed characterization of FFT architectures can be found in [SSHA08]. The implementation of the NTT on graphic cards is described in [Eme09].…”
Section: Previous Work Unrelated To Lattice-based Cryptographymentioning
confidence: 99%