2016 Annual Reliability and Maintainability Symposium (RAMS) 2016
DOI: 10.1109/rams.2016.7448033
|View full text |Cite
|
Sign up to set email alerts
|

Predicting hardware failure using machine learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 26 publications
(10 citation statements)
references
References 3 publications
0
10
0
Order By: Relevance
“…However, sequential data was not used, and thus, the likely interdependence of features over time was not considered. Other authors predict the time until hardware components failed [15]. Furthermore, in previous research, we showed that LSTM outperformed other algorithms like FCN or residual neural network when applied to time series data, however, struggled with data imbalance [16].…”
Section: Introductionmentioning
confidence: 64%
“…However, sequential data was not used, and thus, the likely interdependence of features over time was not considered. Other authors predict the time until hardware components failed [15]. Furthermore, in previous research, we showed that LSTM outperformed other algorithms like FCN or residual neural network when applied to time series data, however, struggled with data imbalance [16].…”
Section: Introductionmentioning
confidence: 64%
“…The authors in [19] have considered each of the devices and their components separately, labelling them as faulty or not. Thibaux et al [5] decided to distinguish between three classes: "impending failure detected", "not impending failure detected" and "uncertain about future failure". In the case of this work, it was decided to consider the issue as a multiclass classification problem where it will be anticipated which of the four components will fail or none.…”
Section: Data Structure and Preprocessingmentioning
confidence: 99%
“…However, very few work has attempted to fully analyze and predict high performance cloud system data empirically using a failure-in-production real-time data.The authors in [18] have made a good attempt to analyse the failure data of a large-scale production Cloud environment consisting of over 12,500 servers, which includes a study of failure and repair times and characteristics for both Cloud workloads and servers, but they never looked at the failure correlation between workload intensity and size of the system respectively. The author in [19] developed a machine learning approach for predicting individual component times until failure which they reported it as far more accurate than the traditional MTBF approach. Their algorithm was built to be able to monitor the health of 14 hardware samples and notify them of an impending failure well ahead of actual failure, providing adequate time to fix the problem before actual failure occurred.…”
Section: Related Workmentioning
confidence: 99%