High-level synthesis for large bit-width multipliers on FPGAs

Quan, Gang; Davis, Joan M.; Devarkal, Siddhaveerasharan; Buell, Duncan A.

doi:10.1145/1084834.1084890

Cited by 23 publications

(20 citation statements)

References 20 publications

(14 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For a fair comparison, we also implemented the 256-bit point addition in the same Virtex-II technology. From Table 5 we can see that the speedup in our case with respect to the Beowulf implementation is 11.2 and the speedup with respect to the FPGA-embedded multiplier of [9] is 1.4. Note also that they would require a two-chip Virtex-II implementation, for point addition and point doubling, respectively, whereas our pipeline folding results in one FPGA chip design.…”

Section: Resultsmentioning

confidence: 80%

“…Virtex-4 delay of scalar point multiplication based on Virtex-II Pro FPGA. We compared our 256-bit point addition results to a multiprocessor implementation based on a 32-node Beowulf cluster [9]. The results shown in Table 5 also include their Virtex-II implementation which uses 144 18-bit embedded multipliers.…”

Section: Resultsmentioning

confidence: 99%

“…Cluster [9] Embedded Multiplier [9] Folded Pipeline in our case 196.72µs 24.56µs 17.512µs Other FPGA designs of the elliptic cryptosystem architectures are in [6,8]. Comparison with [6] is difficult because they use the embedded multipliers in a Spartan FPGA, whereas we use the older Virtex-II and do not use any embedded multipliers.…”

Section: -Node Beowulfmentioning

confidence: 99%

See 2 more Smart Citations

An Elliptic Curve Cryptosystem Design Based on FPGA Pipeline Folding

Al-Khaleel

Papachristou

Wolff

et al. 2007

13th IEEE International on-Line Testing Symposium (IOLTS 2007)

View full text Add to dashboard Cite

show abstract

Section: Resultsmentioning

confidence: 80%

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

An Elliptic Curve Cryptosystem Design Based on FPGA Pipeline Folding

Al-Khaleel

Papachristou

Wolff

et al. 2007

13th IEEE International on-Line Testing Symposium (IOLTS 2007)

View full text Add to dashboard Cite

show abstract

“…References [7]- [13] have more detailed information about large width multipliers, further references are mentioned as we go.…”

Section: Architectures Consideredmentioning

confidence: 99%

A Study of Large Width Unsigned Multipliers on FPGAs

Sayed¹,

Aly²

2013

IJCEE

View full text Add to dashboard Cite

Abstract-Multiplication is an important fundamental operation in most signal and image processing applications. High definition image processing has put a huge demand on fast and massive data processing and shrinking the CMOS process made the silicon real estate available to provide for such massive data processing building blocks. We compare large width multipliers from an architecture point of view, maximum clock frequency, latency, throughput, resource usage, power consumption. We use a flopped combinational baseline multiplier for our comparison and we use the same FPGA platform to be fair in our analysis. We mention some remarks and conclude that shift and add is the best.Index Terms-Computer arithmetic, FPGA, low power, multiplier, verilog.

show abstract

“…While increasing m, implementations become very time and resource consuming. Most of the known architectures concern the acceleration of the multiplication process by modifying the elliptic equations by changing the Z coordinate term [5], or by multiplication scalability [6], or by using many serial and parallel Arithmetic units [7], or using High parallel Karatsuba Multiplier [8], those based on the MassyOmura multipliers, or the work based on a hybrid multipliers approach, also some parallel approach approaches, or the new word level structure, or through the systolic architecture, or by using the half and add method, or by parallelizing both the add and double Montgomery algorithms [9].…”

Section: Hardware Designmentioning

confidence: 99%

Reconfigurable Architecture for Network processing

Rahuman¹,

Athisha²

2012

IJCA

View full text Add to dashboard Cite

The high performance of an elliptic curve (EC) crypto system depends efficiently on the arithmetic in the underlying finite field. We have to propose and compare two levels of Galois Field GF( 2 163 ) and GF (2 193 ). The proposed architecture is based on Lopez-Dehab elliptic curve point multiplication algorithm, which uses Gaussian normal basis for GF(2 163 ) field arithmetic. In which derived parallelized elliptic curve point doubling and addition algorithms with uniform addressing are based on Lopez-Dehab method. The proposed GF (2 193 ) is based on an efficient Montgomery add and double algorithm, also the karatsuba-offman multiplier and Itoh-Tsjuii algorithm are used as the inverse component. The hardware design is based on optimized Finite State Machine(FSM), with a single cycle 193 bits multiplier, field adder and field squarer . The different optimization at the hardware level improves the acceleration of the ECC scalar multiplication; increases frequency and the speed of operation such as key generation, encryption and decryption. Finally we have to implement our design using Xilinx XC4VLX200 FPGA device.

show abstract

High-level synthesis for large bit-width multipliers on FPGAs

Cited by 23 publications

References 20 publications

An Elliptic Curve Cryptosystem Design Based on FPGA Pipeline Folding

An Elliptic Curve Cryptosystem Design Based on FPGA Pipeline Folding

A Study of Large Width Unsigned Multipliers on FPGAs

Reconfigurable Architecture for Network processing

Contact Info

Product

Resources

About