2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) 2019
DOI: 10.1109/dsn.2019.00017
|View full text |Cite
|
Sign up to set email alerts
|

Understanding and Modeling On-Die Error Correction in Modern DRAM: An Experimental Study Using Real Devices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
80
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 48 publications
(82 citation statements)
references
References 96 publications
2
80
0
Order By: Relevance
“…What sets DRAM disturbance errors apart from other technologies' disturbance errors is that 1) DRAM is exposed to the user-level programs and manipulated directly by a program's load and store instructions (which we do not anticipate to change any time soon, since direct data manipulation in main memory is a fundamental component of programming languages and systems), and 2) in modern DRAM, as opposed to other technologies, strong error correction mechanisms are not commonly employed (either in the memory controller or the memory chip). The success of DRAM scaling until recently has not relied on a memory controller that corrects errors (other than performing periodic refresh and more recently employing very simple single-error correcting codes [9,113,185,186,191,195]). Instead, DRAM chips were implicitly assumed to be error-free and did not require the help of the controller to operate correctly.…”
Section: H Rowhammer In a Broader Contextmentioning
confidence: 99%
“…What sets DRAM disturbance errors apart from other technologies' disturbance errors is that 1) DRAM is exposed to the user-level programs and manipulated directly by a program's load and store instructions (which we do not anticipate to change any time soon, since direct data manipulation in main memory is a fundamental component of programming languages and systems), and 2) in modern DRAM, as opposed to other technologies, strong error correction mechanisms are not commonly employed (either in the memory controller or the memory chip). The success of DRAM scaling until recently has not relied on a memory controller that corrects errors (other than performing periodic refresh and more recently employing very simple single-error correcting codes [9,113,185,186,191,195]). Instead, DRAM chips were implicitly assumed to be error-free and did not require the help of the controller to operate correctly.…”
Section: H Rowhammer In a Broader Contextmentioning
confidence: 99%
“…As there is a large number of rows in the chip, the memory controller issues a refresh command once every 7.8 us to complete the refresh cycle for the entire DRAM in 64 ms. A single refresh command typically refreshes multiple rows in a batch in hundreds of nanoseconds. 2 DRAM refresh consumes a significant amount of energy and its overhead is expected to further increase in future DRAM devices as DRAM capacity increases [3,9,13,24,46,48,49,51,72,73,81,83,87,[92][93][94]114]. For example, Liu et al [72] show that refreshes constitute 15% of the total DRAM energy for a 4Gb DDR3 chip, and the fraction of DRAM energy spent on DRAM refresh is projected to increase as DRAM chips become denser (e.g., refreshes would consume about 50% of the total DRAM energy in future 64 Gb DRAM chips).…”
Section: Dram Organization and Operationmentioning
confidence: 99%
“…Refresh is a growing major energy and performance bottleneck with the scaling of the DRAM technology [46,72]. RTC mitigates this negative scaling trend [3,9,13,24,46,48,49,51,72,73,81,83,87,[92][93][94]114] for a class of applications by minimizing the need to refresh with its RTT and PAAR techniques. For a 64 Gb DRAM chip, even when working at peak bandwidth, refresh is expected to consume 46% of the total DRAM energy [45,72].…”
Section: Scalability Benefitsmentioning
confidence: 99%
“…Model Selection. EDEN applies a maximum likelihood estimation (MLE) [128] procedure to determine 1) the parameters (P, F A , P B , F B , P W , F W , F V 1 and F V 0 ) of each error model, and 2) the error model that is most likely to produce the errors observed in the real approximate DRAM chip. In case two models have very similar probability of producing the observed errors, our selection mechanism chooses Error Model 0 if possible, or one of the error models randomly otherwise.…”
Section: Enabling Eden With Error Modelsmentioning
confidence: 99%