2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines 2013
DOI: 10.1109/fccm.2013.35
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Large Integer Squarers on FPGA

Abstract: This paper presents an optimised high throughput architecture for integer squaring on FPGAs. The approach reduces the number of DSP blocks required compared to a standard multiplier. Previous work has proposed the tiling method for double precision squaring, using the least number of DSP blocks so far. However that approach incurs a large overhead in terms of look-up table (LUT) consumption and has a complex and irregular structure that is not suitable for higher word size. The architecture proposed in this pa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…Given a number M [0 : −n+1] of n bits, and making a = M [0 : −n/2+1] and b = M [−n/2 : −n + 1], each partial product has 1/4 of the complexity of the full product, and a reduction in circuit area of nearly 25% is achieved. For large values of n, breaking up M in 3 or more chunks allows for additional gains as shown for Xilinx FPGAs in [14].…”
Section: Squaringmentioning
confidence: 99%
“…Given a number M [0 : −n+1] of n bits, and making a = M [0 : −n/2+1] and b = M [−n/2 : −n + 1], each partial product has 1/4 of the complexity of the full product, and a reduction in circuit area of nearly 25% is achieved. For large values of n, breaking up M in 3 or more chunks allows for additional gains as shown for Xilinx FPGAs in [14].…”
Section: Squaringmentioning
confidence: 99%
“…Various previous works cover the efficient realization of squarers on FPGAs by improving the utilization of the DSP units and handling the weighting between DSP and LUT resources [13]- [15]. In the work of Lee and Burgess [14], a combination of efficient 2×k multipliers together with a single DSP block is used to reduce its complexity.…”
Section: Introductionmentioning
confidence: 99%
“…In the work of Lee and Burgess [14], a combination of efficient 2×k multipliers together with a single DSP block is used to reduce its complexity. The works in [13] and [15] address large squarer design by using modifications of the Karatsuba-Ofman algorithm to save DSPs.…”
Section: Introductionmentioning
confidence: 99%