2021
DOI: 10.1109/jssc.2021.3056447
|View full text |Cite
|
Sign up to set email alerts
|

CAP-RAM: A Charge-Domain In-Memory Computing 6T-SRAM for Accurate and Precision-Programmable CNN Inference

Abstract: A compact, accurate, and bitwidth-programmable in-memory computing (IMC) static random-access memory (SRAM) macro, named CAP-RAM, is presented for energy-efficient convolutional neural network (CNN) inference. It leverages a novel charge-domain multiply-and-accumulate (MAC) mechanism and circuitry to achieve superior linearity under process variations compared to conventional IMC designs. The adopted semi-parallel architecture efficiently stores filters from multiple CNN layers by sharing eight standard 6T SRA… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
28
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 59 publications
(28 citation statements)
references
References 33 publications
(58 reference statements)
0
28
0
Order By: Relevance
“…In cryptographic computing, the requirement on PIM is different from that of machine learning applications because of its zero tolerance to compute errors. Thus, bit-parallel and bitserial operations are more suitable than the lossy computing mechanisms in current, charge, or voltage domains [17], [18], [19], [20]. Recent Digital in-SRAM architectures [21], [22] have been designed to compute with full precision and high parallelism.…”
Section: Processing In Memory For Cryptographic Accelerationmentioning
confidence: 99%
“…In cryptographic computing, the requirement on PIM is different from that of machine learning applications because of its zero tolerance to compute errors. Thus, bit-parallel and bitserial operations are more suitable than the lossy computing mechanisms in current, charge, or voltage domains [17], [18], [19], [20]. Recent Digital in-SRAM architectures [21], [22] have been designed to compute with full precision and high parallelism.…”
Section: Processing In Memory For Cryptographic Accelerationmentioning
confidence: 99%
“…Early IMC attempts [31] employed few Analog to Digital Converters(ADCs) multiplexed across many MACs [17], [22], [31]. The number of ADCs in more recent approaches is higher [24], [27], [42], aiming to maximise speed through parallelism in the computation. As a result of memory folding we get that bit-lines(BL) are multiplexed to data-line(DL), making it so that one MAC is generated per DL at any given time.…”
Section: Introductionmentioning
confidence: 99%
“…Many studies have implemented PIM based on static random-access memory (SRAM) due to its logic compatibility and high operation speed [1][2][3][4][5][6][7]. However, SRAM-based PIMs have the limitations of low bit density and large silicon area [1,2].…”
Section: Introductionmentioning
confidence: 99%
“…Previous eDRAM structures extended the tention time by employing an additional capacitor in the gain cell. However, the multip accumulate (MAC) operation in an analog PIM usually requires metal-oxide-me (MOM) coupling capacitors [3][4][5], and a sufficiently large capacitor cannot be employ in the gain cell because of the area constraint. In addition, for the same gain cell archit ture, process scaling to the ultra-deep submicron scale further reduces the retention tim As shown in Figure 2, for the same two-transistor (2T) gain cell structure [20,21], simulated retention time decreases by approximately 300 times as the channel length creases from 180 to 28 nm owing to an increased leakage current and a reduced paras capacitance.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation