2009 International Conference on Field Programmable Logic and Applications 2009
DOI: 10.1109/fpl.2009.5272296
|View full text |Cite
|
Sign up to set email alerts
|

Large multipliers with fewer DSP blocks

Abstract: Recent computing-oriented FPGAs feature DSP blocks including small embedded multipliers. A large integer multiplier, for instance for a double-precision floating-point multiplier, consumes many of these DSP blocks. This article studies three non-standard implementation techniques of large multipliers: the Karatsuba-Ofman algorithm, nonstandard multiplier tiling, and specialized squarers. They allow for large multipliers working at the peak frequency of the DSP blocks while reducing the DSP block usage. Their o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
83
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(83 citation statements)
references
References 2 publications
0
83
0
Order By: Relevance
“…In our previous work [28], an efficient squarer was proposed which consumes up to 50% less hardware resources than an equivalent width multiplier. It can use 1 fewer DSP block than the method in [29] at a cost of only 127 additional LUTs. The architecture for a 52-bit squarer is shown in Figure 4.…”
Section: Fpga Computationmentioning
confidence: 99%
See 1 more Smart Citation
“…In our previous work [28], an efficient squarer was proposed which consumes up to 50% less hardware resources than an equivalent width multiplier. It can use 1 fewer DSP block than the method in [29] at a cost of only 127 additional LUTs. The architecture for a 52-bit squarer is shown in Figure 4.…”
Section: Fpga Computationmentioning
confidence: 99%
“…Note that in order to achieve maximum DSP block frequency, it is necessary to add an additional register stage any time a DSP block output is passed to LUTs (for implemented small adders for example). This has not been taken into account in [29] and [13], but is done by default in this work.…”
Section: Fpga Computationmentioning
confidence: 99%
“…If the input size of an operand is not an exact multiple of the inputs (m, n) of the embedded multiplier, the last digit obtained by the decomposition is zero-padded to match the nearest multiple. Exploiting the leading zeros and approaches similar to the non-standard tiling [18] is a task considered for future.…”
Section: Operand Decompositionmentioning
confidence: 99%
“…The authors in [18] explore three alternative types of large integer multiplier generation for FPGAs: Karatsuba-Ofman algorithm, non-standard tiling (an alternate, less regular form of divide and conquer) and specialized squarers. The Karatsuba-Ofman algorithm trades multiplications for additions by rearranging the creation of partial products and thereby reducing the number of multipliers/DSP blocks required.…”
Section: Related Workmentioning
confidence: 99%
“…However, the performance of the multiplier is suitable only when operand sizes are small. In this regard, Florent de Dinechin and Bogdan Pasca [3] have also presented their work in which, they used fewer DSP blocks to realize large multipliers. They demonstrated better performance, in terms of saving precious DSP blocks and maintaining the operating frequency.…”
Section: Introductionmentioning
confidence: 99%