2012
DOI: 10.1145/2366231.2337202
|View full text |Cite
|
Sign up to set email alerts
|

A case for exploiting subarray-level parallelism (SALP) in DRAM

Abstract: Modern DRAMs have multiple banks to serve multiple memory requests in parallel. However, when two requests go to the same bank, they have to be served serially, exacerbating the high latency of off-chip memory. Adding more banks to the system to mitigate this problem incurs high system cost. Our goal in this work is to achieve the benefits of increasing the number of banks with a low cost approach. To this end, we propose three new mechanisms that overlap the latencies of different requests that go to the same… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
223
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 148 publications
(224 citation statements)
references
References 42 publications
1
223
0
Order By: Relevance
“…We use PinPoints [58] to obtain the representative phases of each application. Our simulation executes at least 200 million instructions on each core [9,16,35,38]. Performance Metric.…”
Section: Evaluation Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…We use PinPoints [58] to obtain the representative phases of each application. Our simulation executes at least 200 million instructions on each core [9,16,35,38]. Performance Metric.…”
Section: Evaluation Methodologymentioning
confidence: 99%
“…Several types of commodity DRAM (Micron's RLDRAM [52] and Fujitsu's FCRAM [62]) provide low latency at the cost of high area overhead [35,38]. Many prior works (e.g., [8,9,17,35,38,45,56,65,66,70,84]) propose various architectural changes within DRAM chips to reduce latency. In contrast, FLY-DRAM does not require any changes to a DRAM chip.…”
Section: Related Workmentioning
confidence: 99%
“…These techniques can be applied to the custom DRAM technology in SILO to increase vault capacities without compromising the access latency. Other techniques target reducing DRAM latency by overlapping accesses to different subarrays [60] and improving row-buffer locality by exploiting access patterns [61]- [64]. While these techniques allow overlapping access latencies of different requests, they do not reduce the actual access latency.…”
Section: Dram Latency Optimizationmentioning
confidence: 99%
“…The subarray controller consists of address latches, local decoders, and counters. The address latches are essential for multisubarray activation [54]. The counters are used for continuously updating addresses to local decoders for the bulk-style µ-operations.…”
Section: Microarchitecture For Controllersmentioning
confidence: 99%
“…To achieve this, each subarray and bank has their independent controllers with latches. Previous work [54] shows such modication incurs ignorable area overhead. The detailed controller design is shown in Section 4.3.…”
Section: Optimizing Bank Reorganizationmentioning
confidence: 99%