2014
DOI: 10.1109/tc.2012.218
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Montgomery Modular Multiplication Architecture with Low-Latency and Low-Memory Bandwidth Requirement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(4 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Ref. [25] reorganizes the operands on the basis of [19] to achieve low memory bandwidth and high frequency while keeping the number of iterations unchanged, but its delay chain under high input bit width contains two-stage multiplication and addition modules, so the balance between frequency and total number of cycles cannot be achieved. Although [11] also uses the full-carry-save method, it uses CPA to complete the data conversion in the iterative process, thus reducing the overall running frequency.…”
Section: Results and Comparisonsmentioning
confidence: 99%
“…Ref. [25] reorganizes the operands on the basis of [19] to achieve low memory bandwidth and high frequency while keeping the number of iterations unchanged, but its delay chain under high input bit width contains two-stage multiplication and addition modules, so the balance between frequency and total number of cycles cannot be achieved. Although [11] also uses the full-carry-save method, it uses CPA to complete the data conversion in the iterative process, thus reducing the overall running frequency.…”
Section: Results and Comparisonsmentioning
confidence: 99%
“…In the recent study, the dependency graph and multiple process elements (PEs) are the research hotspots in the Montgomery modular multiplication algorithm. According to the algorithm, Lin et al [5,13,14] proposed a hardware architecture consisting of multiple PEs to work in parallel for reducing the delay and the memory bandwidth requirement, and achieving higher throughput. Renardy et al [15] designed an iterative modular architecture on FPGA and achieved better 2 AT (area delay).…”
Section: Related Workmentioning
confidence: 99%
“…The multiplier component implemented by them have increased the throughput by using single cell that comprised of digital multiplier and adder circuits. In (Lin, Ye, & Shieh, 2014) a new technique that relaxed data dependency in load-based algorithms and tried to reuse the referred word of a variable to realize the implementation of numerous Montgomery modular multiplication algorithms namely high-radix algorithm. It claims 54% cut down of power utility without any degradation in speed.…”
Section: Montgomery Multiplicationmentioning
confidence: 99%