2018
DOI: 10.1109/tcad.2018.2857044
|View full text |Cite
|
Sign up to set email alerts
|

McDRAM: Low Latency and Energy-Efficient Matrix Computations in DRAM

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
66
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 65 publications
(66 citation statements)
references
References 13 publications
0
66
0
Order By: Relevance
“…We synthesized both the NDR and IDR units using the Synopsys Design Compiler with 40nm CMOS technology at a frequency of 300 and 200 MHz and scaled the result of IDR to a 20nm DRAM process referring to [5], [12]. The total area overhead of IDRs is 0.903 mm 2 per DRAM chip, assuming that each chip has ×8 data I/O bit-width.…”
Section: Design Overheadmentioning
confidence: 99%
“…We synthesized both the NDR and IDR units using the Synopsys Design Compiler with 40nm CMOS technology at a frequency of 300 and 200 MHz and scaled the result of IDR to a 20nm DRAM process referring to [5], [12]. The total area overhead of IDRs is 0.903 mm 2 per DRAM chip, assuming that each chip has ×8 data I/O bit-width.…”
Section: Design Overheadmentioning
confidence: 99%
“…If a memory controller receives the standard DRAM memory requests while a PIM operation is in progress, the controller should service the requests as soon as possible to satisfy their performance requirement. However, the previous PIM research did not provide the solutions, and instead, the standard memory requests are assumed to be not received when the PIM operation is in progress [19], [21]- [23], [42]. However, their cooperation should be studied.…”
Section: B Cooperating With Standard Memory Requestsmentioning
confidence: 99%
“…One of the most critical issues in implementing PIM is the area overhead of MAC ALUs. McDRAM [42] analyzed the overhead when designing MAC ALUs for BLSAs, column decoders, and I/O drivers in DRAM. Also, [42] showed that the design of 256 8-bit MACs at the column decoder positions incurs the area overhead of about 4.7% of the total DRAM area.…”
Section: Pim Datapath and Commandsmentioning
confidence: 99%
See 2 more Smart Citations