2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA) 2017
DOI: 10.1109/icmla.2017.00-92
|View full text |Cite
|
Sign up to set email alerts
|

Predictive Models of Hard Drive Failures Based on Operational Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(21 citation statements)
references
References 11 publications
0
20
0
Order By: Relevance
“…Many researchers and industrial partners have also focused their efforts on analyzing real-world datasets. For example, there are plenty of works in the literature that concentrated on using Backblaze's public dataset to extract some useful models [21]- [28]. Other works such as [29] base their analysis (latent sector error analysis for reliability in this case) on proprietary data collected from production-ready storage systems.…”
Section: A Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Many researchers and industrial partners have also focused their efforts on analyzing real-world datasets. For example, there are plenty of works in the literature that concentrated on using Backblaze's public dataset to extract some useful models [21]- [28]. Other works such as [29] base their analysis (latent sector error analysis for reliability in this case) on proprietary data collected from production-ready storage systems.…”
Section: A Related Workmentioning
confidence: 99%
“…A case study for predicting hard disk's time-to-failure using regression analysis is given by [24]. The authors in [21] have used various machine learning classification methods to predict hard drive failures. On the other hand, [22] studies statistical techniques to predict disk replacement with 98% accuracy using 30, 000 drives of Backblaze's dataset.…”
Section: A Related Workmentioning
confidence: 99%
“…In [10], the authors came up with disk replacement prediction algorithm with changepoint detection in time series Backblaze data and concluded some rules for directly identifying the state of a device: healthy or faulty. Aussel et al, [6] used the same dataset to perform hard drive failure prediction with SVM, RF and GBT and discussed their performances based on precision and recall. Prediction of remaining useful lives using quantum particle swarm optimization [11] [12] of lithium-ion battery has been discussed in [13] and a host of recent swarm intelligence algorithms [14] can be effectively applied in prediction of RUL of various devices in conjuction with other ML approaches.…”
Section: B Related Workmentioning
confidence: 99%
“…But as this does not indicate the actual efficiency of the prediction model, this cannot be used in a real life scenario. In [6] the authors claimed that the best performance on Backblaze dataset was shown by Random Forest (RF). The Precision and Recall values based on the threshold of device failure within 10 days were recorded as almost 0.93 and 0.6 dividing the dataset into training and test sets using cross-validation techniques, whereas we obtained an average precision of 0.84 and recall of 0.72 using the decision threshold of device failure within 10 days without using any future information in the simulation process.…”
Section: Comparative Analyses With Existing Researchmentioning
confidence: 99%
See 1 more Smart Citation