Proceedings of the 3rd IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis 2005
DOI: 10.1145/1084834.1084890
|View full text |Cite
|
Sign up to set email alerts
|

High-level synthesis for large bit-width multipliers on FPGAs

Abstract: In this paper, we present the analysis, design and implementation of an estimator to realize large bit width unsigned integer multiplier units. Larger multiplier units are required for cryptography and error correction circuits for more secure and reliable transmissions over highly insecure and/or noisy channels in networking and multimedia applications. The design space for these circuits is very large when integer multiplication on large operands is carried out hierarchically. In this paper, we explore autom… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2007
2007
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(20 citation statements)
references
References 20 publications
(14 reference statements)
0
20
0
Order By: Relevance
“…For a fair comparison, we also implemented the 256-bit point addition in the same Virtex-II technology. From Table 5 we can see that the speedup in our case with respect to the Beowulf implementation is 11.2 and the speedup with respect to the FPGA-embedded multiplier of [9] is 1.4. Note also that they would require a two-chip Virtex-II implementation, for point addition and point doubling, respectively, whereas our pipeline folding results in one FPGA chip design.…”
Section: Resultsmentioning
confidence: 80%
See 2 more Smart Citations
“…For a fair comparison, we also implemented the 256-bit point addition in the same Virtex-II technology. From Table 5 we can see that the speedup in our case with respect to the Beowulf implementation is 11.2 and the speedup with respect to the FPGA-embedded multiplier of [9] is 1.4. Note also that they would require a two-chip Virtex-II implementation, for point addition and point doubling, respectively, whereas our pipeline folding results in one FPGA chip design.…”
Section: Resultsmentioning
confidence: 80%
“…Virtex-4 delay of scalar point multiplication based on Virtex-II Pro FPGA. We compared our 256-bit point addition results to a multiprocessor implementation based on a 32-node Beowulf cluster [9]. The results shown in Table 5 also include their Virtex-II implementation which uses 144 18-bit embedded multipliers.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…References [7]- [13] have more detailed information about large width multipliers, further references are mentioned as we go.…”
Section: Architectures Consideredmentioning
confidence: 99%
“…While increasing m, implementations become very time and resource consuming. Most of the known architectures concern the acceleration of the multiplication process by modifying the elliptic equations by changing the Z coordinate term [5], or by multiplication scalability [6], or by using many serial and parallel Arithmetic units [7], or using High parallel Karatsuba Multiplier [8], those based on the MassyOmura multipliers, or the work based on a hybrid multipliers approach, also some parallel approach approaches, or the new word level structure, or through the systolic architecture, or by using the half and add method, or by parallelizing both the add and double Montgomery algorithms [9].…”
Section: Hardware Designmentioning
confidence: 99%