2016
DOI: 10.1145/2992782
|View full text |Cite
|
Sign up to set email alerts
|

Reliability Analysis of SSDs Under Power Fault

Abstract: Modern storage technology (solid-state disks (SSDs), NoSQL databases, commoditized RAID hardware, etc.) brings new reliability challenges to the already-complicated storage stack. Among other things, the behavior of these new components during power faults—which happen relatively frequently in data centers—is an important yet mostly ignored issue in this dependability-critical area. Understanding how new storage components behave under power fault is the first step towards designing new robust storage systems.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 32 publications
0
9
0
Order By: Relevance
“…We have built a preliminary prototype of X-Ray and applied it to diagnose 5 failure cases based on real bugs from the literature [101,70,76,63]. We discuss one case in details and summarize the results at the end.…”
Section: Preliminary Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We have built a preliminary prototype of X-Ray and applied it to diagnose 5 failure cases based on real bugs from the literature [101,70,76,63]. We discuss one case in details and summarize the results at the end.…”
Section: Preliminary Resultsmentioning
confidence: 99%
“…Their initial experiments were performed on Linux kernel v2.6.32, and eight out of fifteen SSDs exhibited a symptom called "serialization errors" [100]. However, in their follow-up work where similar experiments were conducted on a newer kernel (v3.16.0) [101], the authors observed that the failure symptoms on some SSDs changed significantly (see Table 1, adapted from [101]). It was eventually confirmed that the different symptoms was caused by a syncrelated Linux kernel bug [101].…”
Section: Motivationmentioning
confidence: 99%
“…References [5,[8][9][10][11][12][13] studied the failure trend in flash chips. Other studies analyzed different factors in flash failures, such as read disturb [14][15][16][17], program disturb [9,18], data retention [10,19] and power faults [20,21]. Additionally, there are studies examining the influence of different factors on the error rate of flash chips [13].…”
Section: Related Workmentioning
confidence: 99%
“…As shown in Table 4, each record contained a checksum to detect corruption within the record. Furthermore, the record included self-identification information (e.g., the expected range and LBA) to allow detection of misplacement of the record (e.g., flying writes [21,22]), which may pass the examination of the checksum. The other fields (e.g., size and rand) allowed us to regenerate the exact record for verification uniquely.…”
Section: Location Of Writementioning
confidence: 99%