Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2021
DOI: 10.48550/arxiv.2104.15052
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DRAM Failure Prediction in AIOps: Empirical Evaluation, Challenges and Opportunities

Zhiyue Wu,
Hongzuo Xu,
Guansong Pang
et al.

Abstract: DRAM failure prediction is a vital task in AIOps, which is crucial to maintain the reliability and sustainable service of large-scale data centers. However, limited work has been done on DRAM failure prediction mainly due to the lack of public available datasets. This paper presents a comprehensive empirical evaluation of diverse machine learning techniques for DRAM failure prediction using a large-scale multisource dataset, including more than three millions of records of kernel, address, and mcelog data, pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…Over recent decades, with the burgeoning of informatization, a substantial amount of time series data have been continuously created. As the functioning status of various target systems such as large-scale data centers [49], cloud servers [19], space crafts [20], and even human bodies [30], these time series data are a source where we can monitor and alarm potential faults, threats, and risks of target systems by identifying their unusual states (i.e., anomalies). Anomaly detection, an important field in data mining and analytics, is to find exceptional data observations that deviate significantly from the majority [34], which is playing a critical role in achieving this goal.…”
Section: Introductionmentioning
confidence: 99%
“…Over recent decades, with the burgeoning of informatization, a substantial amount of time series data have been continuously created. As the functioning status of various target systems such as large-scale data centers [49], cloud servers [19], space crafts [20], and even human bodies [30], these time series data are a source where we can monitor and alarm potential faults, threats, and risks of target systems by identifying their unusual states (i.e., anomalies). Anomaly detection, an important field in data mining and analytics, is to find exceptional data observations that deviate significantly from the majority [34], which is playing a critical role in achieving this goal.…”
Section: Introductionmentioning
confidence: 99%