2019
DOI: 10.1109/jxcdc.2019.2955016
|View full text |Cite
|
Sign up to set email alerts
|

Energy and Performance Benchmarking of a Domain Wall-Magnetic Tunnel Junction Multibit Adder

Abstract: The domain-wall (DW)-magnetic tunnel junction (MTJ) device implements universal Boolean logic in a manner that is naturally compact and cascadable. However, an evaluation of the energy efficiency of this emerging technology for standard logic applications is still lacking. In this article, we use a previously developed compact model to construct and benchmark a 32-bit adder entirely from DW-MTJ devices that communicates with DW-MTJ registers. The results of this large-scale design and simulation indicate that … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
10
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 47 publications
1
10
0
Order By: Relevance
“…As depicted in Fig. 6, a 4:16 triple-row decoder can be designed by interleaving four 2:4 dynamic N AN D decoders 5 . Since single-row decoding must co-exist with triple-row decoding, an address translator circuit is used to switch between the two modes using M AJ as a control signal.…”
Section: A Functional Completenessmentioning
confidence: 99%
See 1 more Smart Citation
“…As depicted in Fig. 6, a 4:16 triple-row decoder can be designed by interleaving four 2:4 dynamic N AN D decoders 5 . Since single-row decoding must co-exist with triple-row decoding, an address translator circuit is used to switch between the two modes using M AJ as a control signal.…”
Section: A Functional Completenessmentioning
confidence: 99%
“…As a quantitative example, [3] points out that the energy for DRAM access is 3556× the energy for 16-bit addition in 45 nm CMOS technology. Similarly, DRAM access latency is ≈ 100 ns [4], while latency of 32-bit adder is 4 ns in CMOS technology [5], implying that data movement latency forms a significant portion of computation latency in conventional von-Neumann computing model. Consequently, there had been many efforts in the last 10-15 years to combat the memory wall by bringing the processor and memory unit closer together.…”
Section: Introductionmentioning
confidence: 99%
“…IV. RESULTS The network was initially simulated in Cadence using models that followed DW-MTJ behavior [36]- [38]. Larger network sizes took a prohibitively long time to simulate, so a specialized simulation program was created which was able to perform the calculations significantly faster.…”
Section: Parameters 1) Mtj Parametersmentioning
confidence: 99%
“…As noted in Tables 3 and 4, in-memory adders require tens of steps for addition operations. Even if a single step takes 5 ns (RRAMs can switch in a few ns), this would be much larger than the latency incurred in CMOS technology (32-bit addition operation can be performed in 4 ns in CMOS technology [62]). However, in in-memory arithmetic, the energy and latency (hundreds of ns) for data movement is avoided (the numbers to be added have to be moved from DRAM memory to processor in conventional approach).…”
Section: Xor B 0 a 0 B 1 A 1 B 2 A 2 B 3 A 3 B 4 A 4 B 5 A 5 B 6 A 6 mentioning
confidence: 99%